Overview

Dataset statistics

Number of variables4
Number of observations299
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.3%
Total size in memory9.5 KiB
Average record size in memory32.4 B

Variable types

Text3
Categorical1

Dataset

Description한국산업단지공단 대표 홈페이지(KICOX) 내 메뉴경로 안내입니다. 메뉴명, 최상위 메뉴, 부모메뉴, 메뉴경로 데이터를 제공하고 있습니다.
URLhttps://www.data.go.kr/data/15117612/fileData.do

Alerts

Dataset has 1 (0.3%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 15:43:39.402957
Analysis finished2023-12-12 15:43:40.000958
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct280
Distinct (%)93.6%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-13T00:43:40.190614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length7.7959866
Min length2

Characters and Unicode

Total characters2331
Distinct characters310
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique262 ?
Unique (%)87.6%

Sample

1st row신규 산업단지 기획 공급
2nd row청렴도 · 부패방지 시책평가 결과
3rd rowCluster
4th row입주기업 저탄소 전환 지원
5th row입주기업 저탄소 전환 지원
ValueCountFrequency (%)
산업단지 16
 
3.0%
13
 
2.4%
개인정보처리방침 7
 
1.3%
지원 7
 
1.3%
디지털 6
 
1.1%
조성 5
 
0.9%
추진체계 5
 
0.9%
정보공개 4
 
0.7%
공공데이터 4
 
0.7%
입주기업 4
 
0.7%
Other values (371) 468
86.8%
2023-12-13T00:43:40.660566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
240
 
10.3%
82
 
3.5%
78
 
3.3%
55
 
2.4%
55
 
2.4%
49
 
2.1%
48
 
2.1%
44
 
1.9%
39
 
1.7%
36
 
1.5%
Other values (300) 1605
68.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1944
83.4%
Space Separator 240
 
10.3%
Uppercase Letter 68
 
2.9%
Decimal Number 39
 
1.7%
Other Punctuation 15
 
0.6%
Lowercase Letter 15
 
0.6%
Close Punctuation 4
 
0.2%
Open Punctuation 4
 
0.2%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
82
 
4.2%
78
 
4.0%
55
 
2.8%
55
 
2.8%
49
 
2.5%
48
 
2.5%
44
 
2.3%
39
 
2.0%
36
 
1.9%
34
 
1.7%
Other values (259) 1424
73.3%
Uppercase Letter
ValueCountFrequency (%)
O 9
13.2%
C 9
13.2%
E 8
11.8%
I 6
8.8%
S 6
8.8%
D 5
7.4%
R 4
 
5.9%
A 3
 
4.4%
G 3
 
4.4%
K 3
 
4.4%
Other values (7) 12
17.6%
Decimal Number
ValueCountFrequency (%)
2 11
28.2%
0 9
23.1%
1 8
20.5%
4 4
 
10.3%
6 2
 
5.1%
3 2
 
5.1%
5 2
 
5.1%
7 1
 
2.6%
Lowercase Letter
ValueCountFrequency (%)
e 3
20.0%
s 3
20.0%
n 2
13.3%
r 2
13.3%
u 2
13.3%
d 1
 
6.7%
l 1
 
6.7%
t 1
 
6.7%
Other Punctuation
ValueCountFrequency (%)
& 6
40.0%
· 5
33.3%
/ 3
20.0%
, 1
 
6.7%
Space Separator
ValueCountFrequency (%)
240
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1944
83.4%
Common 304
 
13.0%
Latin 83
 
3.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
82
 
4.2%
78
 
4.0%
55
 
2.8%
55
 
2.8%
49
 
2.5%
48
 
2.5%
44
 
2.3%
39
 
2.0%
36
 
1.9%
34
 
1.7%
Other values (259) 1424
73.3%
Latin
ValueCountFrequency (%)
O 9
 
10.8%
C 9
 
10.8%
E 8
 
9.6%
I 6
 
7.2%
S 6
 
7.2%
D 5
 
6.0%
R 4
 
4.8%
A 3
 
3.6%
G 3
 
3.6%
K 3
 
3.6%
Other values (15) 27
32.5%
Common
ValueCountFrequency (%)
240
78.9%
2 11
 
3.6%
0 9
 
3.0%
1 8
 
2.6%
& 6
 
2.0%
· 5
 
1.6%
) 4
 
1.3%
( 4
 
1.3%
4 4
 
1.3%
/ 3
 
1.0%
Other values (6) 10
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1944
83.4%
ASCII 382
 
16.4%
None 5
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
240
62.8%
2 11
 
2.9%
O 9
 
2.4%
C 9
 
2.4%
0 9
 
2.4%
1 8
 
2.1%
E 8
 
2.1%
I 6
 
1.6%
S 6
 
1.6%
& 6
 
1.6%
Other values (30) 70
 
18.3%
Hangul
ValueCountFrequency (%)
82
 
4.2%
78
 
4.0%
55
 
2.8%
55
 
2.8%
49
 
2.5%
48
 
2.5%
44
 
2.3%
39
 
2.0%
36
 
1.9%
34
 
1.7%
Other values (259) 1424
73.3%
None
ValueCountFrequency (%)
· 5
100.0%

최상위메뉴
Categorical

Distinct16
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
주요사업
69 
열린경영
62 
정보공개
43 
고객마당
31 
고객서비스
30 
Other values (11)
64 

Length

Max length18
Median length4
Mean length4.7190635
Min length3

Unique

Unique6 ?
Unique (%)2.0%

Sample

1st row주요사업
2nd row청렴도 · 부패방지 시책평가 결과
3rd row주요사업
4th row주요사업
5th row주요사업

Common Values

ValueCountFrequency (%)
주요사업 69
23.1%
열린경영 62
20.7%
정보공개 43
14.4%
고객마당 31
10.4%
고객서비스 30
10.0%
글로벌 선도기업(구) 18
 
6.0%
공단소개 17
 
5.7%
미사용 11
 
3.7%
개인정보처리방침 10
 
3.3%
기업애로 해결 2
 
0.7%
Other values (6) 6
 
2.0%

Length

2023-12-13T00:43:40.858606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
주요사업 69
21.4%
열린경영 62
19.2%
정보공개 43
13.3%
고객마당 31
9.6%
고객서비스 30
9.3%
글로벌 18
 
5.6%
선도기업(구 18
 
5.6%
공단소개 17
 
5.3%
미사용 11
 
3.4%
개인정보처리방침 10
 
3.1%
Other values (12) 14
 
4.3%
Distinct79
Distinct (%)26.4%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-13T00:43:41.140494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length17
Mean length7.1772575
Min length2

Characters and Unicode

Total characters2146
Distinct characters170
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)5.4%

Sample

1st row신 산업입지 조성 공급
2nd rowHOME
3rd row산업집적지경쟁력강화사업
4th row디지털 · 그린 산업환경 조성
5th row디지털 · 그린 산업환경 조성
ValueCountFrequency (%)
조성 20
 
4.0%
윤리경영 15
 
3.0%
디지털 13
 
2.6%
그린 13
 
2.6%
산업환경 13
 
2.6%
home 13
 
2.6%
정보공개 13
 
2.6%
글로벌 13
 
2.6%
· 13
 
2.6%
산업단지 12
 
2.4%
Other values (101) 360
72.3%
2023-12-13T00:43:41.693838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
199
 
9.3%
89
 
4.1%
77
 
3.6%
72
 
3.4%
56
 
2.6%
53
 
2.5%
53
 
2.5%
48
 
2.2%
47
 
2.2%
44
 
2.1%
Other values (160) 1408
65.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1796
83.7%
Space Separator 199
 
9.3%
Uppercase Letter 113
 
5.3%
Other Punctuation 23
 
1.1%
Close Punctuation 5
 
0.2%
Open Punctuation 5
 
0.2%
Lowercase Letter 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
89
 
5.0%
77
 
4.3%
72
 
4.0%
56
 
3.1%
53
 
3.0%
53
 
3.0%
48
 
2.7%
47
 
2.6%
44
 
2.4%
42
 
2.3%
Other values (136) 1215
67.7%
Uppercase Letter
ValueCountFrequency (%)
O 22
19.5%
E 16
14.2%
C 14
12.4%
H 13
11.5%
M 13
11.5%
I 9
8.0%
X 8
 
7.1%
K 8
 
7.1%
P 4
 
3.5%
G 2
 
1.8%
Other values (3) 4
 
3.5%
Other Punctuation
ValueCountFrequency (%)
· 13
56.5%
, 5
 
21.7%
/ 4
 
17.4%
& 1
 
4.3%
Lowercase Letter
ValueCountFrequency (%)
m 2
40.0%
y 1
20.0%
d 1
20.0%
u 1
20.0%
Space Separator
ValueCountFrequency (%)
199
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1796
83.7%
Common 232
 
10.8%
Latin 118
 
5.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
89
 
5.0%
77
 
4.3%
72
 
4.0%
56
 
3.1%
53
 
3.0%
53
 
3.0%
48
 
2.7%
47
 
2.6%
44
 
2.4%
42
 
2.3%
Other values (136) 1215
67.7%
Latin
ValueCountFrequency (%)
O 22
18.6%
E 16
13.6%
C 14
11.9%
H 13
11.0%
M 13
11.0%
I 9
7.6%
X 8
 
6.8%
K 8
 
6.8%
P 4
 
3.4%
m 2
 
1.7%
Other values (7) 9
7.6%
Common
ValueCountFrequency (%)
199
85.8%
· 13
 
5.6%
) 5
 
2.2%
( 5
 
2.2%
, 5
 
2.2%
/ 4
 
1.7%
& 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1796
83.7%
ASCII 337
 
15.7%
None 13
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
199
59.1%
O 22
 
6.5%
E 16
 
4.7%
C 14
 
4.2%
H 13
 
3.9%
M 13
 
3.9%
I 9
 
2.7%
X 8
 
2.4%
K 8
 
2.4%
) 5
 
1.5%
Other values (13) 30
 
8.9%
Hangul
ValueCountFrequency (%)
89
 
5.0%
77
 
4.3%
72
 
4.0%
56
 
3.1%
53
 
3.0%
53
 
3.0%
48
 
2.7%
47
 
2.6%
44
 
2.4%
42
 
2.3%
Other values (136) 1215
67.7%
None
ValueCountFrequency (%)
· 13
100.0%
Distinct298
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-13T00:43:41.991687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length51
Mean length34.715719
Min length4

Characters and Unicode

Total characters10380
Distinct characters311
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique297 ?
Unique (%)99.3%

Sample

1st rowHOME > 주요사업 > 신 산업입지 조성 공급 > 신규 산업단지 기획 공급
2nd rowHOME > 청렴도 · 부패방지 시책평가 결과
3rd rowHOME > 주요사업 > 입주기업 혁신성장 지원 > 산업집적지경쟁력강화사업 > Cluster
4th rowHOME > 주요사업 > 디지털 · 그린 산업환경 조성 > 입주기업 저탄소 전환 지원
5th rowHOME > 주요사업 > 디지털 · 그린 산업환경 조성 > 입주기업 저탄소 전환 지원
ValueCountFrequency (%)
936
34.3%
home 299
 
11.0%
주요사업 69
 
2.5%
열린경영 62
 
2.3%
정보공개 60
 
2.2%
윤리경영 46
 
1.7%
지원 42
 
1.5%
입주기업 39
 
1.4%
혁신성장 36
 
1.3%
조성 35
 
1.3%
Other values (376) 1105
40.5%
2023-12-13T00:43:42.500564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2430
23.4%
> 935
 
9.0%
O 316
 
3.0%
E 309
 
3.0%
H 300
 
2.9%
M 300
 
2.9%
296
 
2.9%
209
 
2.0%
199
 
1.9%
197
 
1.9%
Other values (301) 4889
47.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5546
53.4%
Space Separator 2430
23.4%
Uppercase Letter 1321
 
12.7%
Math Symbol 935
 
9.0%
Other Punctuation 48
 
0.5%
Decimal Number 39
 
0.4%
Open Punctuation 22
 
0.2%
Close Punctuation 22
 
0.2%
Lowercase Letter 15
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
296
 
5.3%
209
 
3.8%
199
 
3.6%
197
 
3.6%
173
 
3.1%
159
 
2.9%
153
 
2.8%
145
 
2.6%
144
 
2.6%
116
 
2.1%
Other values (259) 3755
67.7%
Uppercase Letter
ValueCountFrequency (%)
O 316
23.9%
E 309
23.4%
H 300
22.7%
M 300
22.7%
C 23
 
1.7%
I 15
 
1.1%
K 11
 
0.8%
X 10
 
0.8%
S 8
 
0.6%
P 6
 
0.5%
Other values (7) 23
 
1.7%
Decimal Number
ValueCountFrequency (%)
2 11
28.2%
0 9
23.1%
1 8
20.5%
4 4
 
10.3%
6 2
 
5.1%
5 2
 
5.1%
3 2
 
5.1%
7 1
 
2.6%
Lowercase Letter
ValueCountFrequency (%)
e 3
20.0%
s 3
20.0%
n 2
13.3%
u 2
13.3%
r 2
13.3%
d 1
 
6.7%
l 1
 
6.7%
t 1
 
6.7%
Other Punctuation
ValueCountFrequency (%)
· 20
41.7%
/ 15
31.2%
& 7
 
14.6%
, 6
 
12.5%
Space Separator
ValueCountFrequency (%)
2430
100.0%
Math Symbol
ValueCountFrequency (%)
> 935
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5546
53.4%
Common 3498
33.7%
Latin 1336
 
12.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
296
 
5.3%
209
 
3.8%
199
 
3.6%
197
 
3.6%
173
 
3.1%
159
 
2.9%
153
 
2.8%
145
 
2.6%
144
 
2.6%
116
 
2.1%
Other values (259) 3755
67.7%
Latin
ValueCountFrequency (%)
O 316
23.7%
E 309
23.1%
H 300
22.5%
M 300
22.5%
C 23
 
1.7%
I 15
 
1.1%
K 11
 
0.8%
X 10
 
0.7%
S 8
 
0.6%
P 6
 
0.4%
Other values (15) 38
 
2.8%
Common
ValueCountFrequency (%)
2430
69.5%
> 935
 
26.7%
( 22
 
0.6%
) 22
 
0.6%
· 20
 
0.6%
/ 15
 
0.4%
2 11
 
0.3%
0 9
 
0.3%
1 8
 
0.2%
& 7
 
0.2%
Other values (7) 19
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5546
53.4%
ASCII 4814
46.4%
None 20
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2430
50.5%
> 935
 
19.4%
O 316
 
6.6%
E 309
 
6.4%
H 300
 
6.2%
M 300
 
6.2%
C 23
 
0.5%
( 22
 
0.5%
) 22
 
0.5%
/ 15
 
0.3%
Other values (31) 142
 
2.9%
Hangul
ValueCountFrequency (%)
296
 
5.3%
209
 
3.8%
199
 
3.6%
197
 
3.6%
173
 
3.1%
159
 
2.9%
153
 
2.8%
145
 
2.6%
144
 
2.6%
116
 
2.1%
Other values (259) 3755
67.7%
None
ValueCountFrequency (%)
· 20
100.0%

Correlations

2023-12-13T00:43:42.644523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
최상위메뉴부모메뉴
최상위메뉴1.0000.972
부모메뉴0.9721.000

Missing values

2023-12-13T00:43:39.887269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:43:39.968194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

메뉴명최상위메뉴부모메뉴메뉴경로
0신규 산업단지 기획 공급주요사업신 산업입지 조성 공급HOME > 주요사업 > 신 산업입지 조성 공급 > 신규 산업단지 기획 공급
1청렴도 · 부패방지 시책평가 결과청렴도 · 부패방지 시책평가 결과HOMEHOME > 청렴도 · 부패방지 시책평가 결과
2Cluster주요사업산업집적지경쟁력강화사업HOME > 주요사업 > 입주기업 혁신성장 지원 > 산업집적지경쟁력강화사업 > Cluster
3입주기업 저탄소 전환 지원주요사업디지털 · 그린 산업환경 조성HOME > 주요사업 > 디지털 · 그린 산업환경 조성 > 입주기업 저탄소 전환 지원
4입주기업 저탄소 전환 지원주요사업디지털 · 그린 산업환경 조성HOME > 주요사업 > 디지털 · 그린 산업환경 조성 > 입주기업 저탄소 전환 지원
5채용공고고객마당기관채용안내HOME > 고객마당 > 기관채용안내 > 채용공고
6교육사업개요미사용산단스마트화교육사업HOME > 미사용 > 산단스마트화교육사업 > 교육사업개요
7교육과정안내미사용산단스마트화교육사업HOME > 미사용 > 산단스마트화교육사업 > 교육과정안내
8기업혁신 CEO 과정미사용교육과정안내HOME > 미사용 > 산단스마트화교육사업 > 교육과정안내 > 기업혁신 CEO 과정
94차 산업혁명 분야별 교육미사용교육과정안내HOME > 미사용 > 산단스마트화교육사업 > 교육과정안내 > 4차 산업혁명 분야별 교육
메뉴명최상위메뉴부모메뉴메뉴경로
289임직원 해외출장열린경영일반공시HOME > 열린경영 > 일반공시 > 임직원 해외출장
290수의계약 견적제출고객마당수의계약 견적제출HOME > 고객마당 > 수의계약 견적제출 > 수의계약 견적제출
291기업민원 보호·서비스헌장고객마당고객헌장HOME > 고객마당 > 고객헌장 > 기업민원 보호·서비스헌장
292기업애로 해결기업애로 해결HOMEHOME > 기업애로 해결
293기업애로 해결기업애로 해결기업애로 해결HOME > 기업애로 해결 > 기업애로 해결
294규제입증요청고객서비스제안HOME > 고객서비스 > 제안 > 규제입증요청
295기업성장응답센터고객서비스규제애로HOME > 고객서비스 > 규제애로 > 기업성장응답센터
296개인정보처리방침 2021년 6월 30일개인정보처리방침개인정보처리방침HOME > 개인정보처리방침 > 개인정보처리방침 2021년 6월 30일
297공공데이터정보공개공공데이터개방HOME > 정보공개 > 공공데이터개방 > 공공데이터
298청렴윤리경영CP열린경영윤리경영HOME > 열린경영 > 윤리경영 > 청렴윤리경영CP

Duplicate rows

Most frequently occurring

메뉴명최상위메뉴부모메뉴메뉴경로# duplicates
0입주기업 저탄소 전환 지원주요사업디지털 · 그린 산업환경 조성HOME > 주요사업 > 디지털 · 그린 산업환경 조성 > 입주기업 저탄소 전환 지원2