Overview

Dataset statistics

Number of variables6
Number of observations39
Missing cells59
Missing cells (%)25.2%
Duplicate rows1
Duplicate rows (%)2.6%
Total size in memory2.0 KiB
Average record size in memory52.4 B

Variable types

Categorical1
Text4
Unsupported1

Dataset

Description계룡시 관내 산업단지에 입주한 입주기업의 현황(산업단지명, 입주업체명, 대표자, 주소, 연락처)에 관한 공공데이터를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=204&beforeMenuCd=DOM_000000201001001000&publicdatapk=15096112

Alerts

Dataset has 1 (2.6%) duplicate rowsDuplicates
입주기업명 has 5 (12.8%) missing valuesMissing
대표자 has 5 (12.8%) missing valuesMissing
주소 has 5 (12.8%) missing valuesMissing
연락처 has 5 (12.8%) missing valuesMissing
Unnamed: 5 has 39 (100.0%) missing valuesMissing
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-01-09 20:26:51.490882
Analysis finished2024-01-09 20:26:51.971775
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

산단명
Categorical

Distinct3
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size444.0 B
계룡제1산업단지
19 
계룡제2산업단지
15 
<NA>

Length

Max length8
Median length8
Mean length7.4871795
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row계룡제1산업단지
2nd row계룡제1산업단지
3rd row계룡제1산업단지
4th row계룡제1산업단지
5th row계룡제1산업단지

Common Values

ValueCountFrequency (%)
계룡제1산업단지 19
48.7%
계룡제2산업단지 15
38.5%
<NA> 5
 
12.8%

Length

2024-01-10T05:26:52.035533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:26:52.130414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
계룡제1산업단지 19
48.7%
계룡제2산업단지 15
38.5%
na 5
 
12.8%

입주기업명
Text

MISSING 

Distinct34
Distinct (%)100.0%
Missing5
Missing (%)12.8%
Memory size444.0 B
2024-01-10T05:26:52.294669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length8
Mean length5.8235294
Min length3

Characters and Unicode

Total characters198
Distinct characters102
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row㈜아워홈
2nd row소이미푸드㈜
3rd row농업회사법인㈜퍼스프
4th row푸드텍농업회사법인㈜
5th row㈜비타바이오
ValueCountFrequency (%)
㈜굿스굿 1
 
2.9%
한국타이어㈜ 1
 
2.9%
팔천식품 1
 
2.9%
㈜훼미리푸드 1
 
2.9%
㈜우리상사 1
 
2.9%
명랑시대외식청년창업협동조합 1
 
2.9%
㈜티에스씨 1
 
2.9%
㈜휴마스 1
 
2.9%
㈜황산벌육가공 1
 
2.9%
농업회사법인㈜퍼스프 1
 
2.9%
Other values (24) 24
70.6%
2024-01-10T05:26:52.588264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29
 
14.6%
10
 
5.1%
9
 
4.5%
5
 
2.5%
4
 
2.0%
3
 
1.5%
3
 
1.5%
3
 
1.5%
3
 
1.5%
3
 
1.5%
Other values (92) 126
63.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 169
85.4%
Other Symbol 29
 
14.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
5.9%
9
 
5.3%
5
 
3.0%
4
 
2.4%
3
 
1.8%
3
 
1.8%
3
 
1.8%
3
 
1.8%
3
 
1.8%
3
 
1.8%
Other values (91) 123
72.8%
Other Symbol
ValueCountFrequency (%)
29
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 198
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
14.6%
10
 
5.1%
9
 
4.5%
5
 
2.5%
4
 
2.0%
3
 
1.5%
3
 
1.5%
3
 
1.5%
3
 
1.5%
3
 
1.5%
Other values (92) 126
63.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 169
85.4%
None 29
 
14.6%

Most frequent character per block

None
ValueCountFrequency (%)
29
100.0%
Hangul
ValueCountFrequency (%)
10
 
5.9%
9
 
5.3%
5
 
3.0%
4
 
2.4%
3
 
1.8%
3
 
1.8%
3
 
1.8%
3
 
1.8%
3
 
1.8%
3
 
1.8%
Other values (91) 123
72.8%

대표자
Text

MISSING 

Distinct34
Distinct (%)100.0%
Missing5
Missing (%)12.8%
Memory size444.0 B
2024-01-10T05:26:52.780383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.1470588
Min length3

Characters and Unicode

Total characters107
Distinct characters64
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row구지은
2nd row신희수
3rd row이충관
4th row한동훈
5th row유기종
ValueCountFrequency (%)
박원규 1
 
2.9%
이도재 1
 
2.9%
정철재 1
 
2.9%
구기운 1
 
2.9%
이종형 1
 
2.9%
안교덕 1
 
2.9%
전영관 1
 
2.9%
이수일 1
 
2.9%
김겸석 1
 
2.9%
김상우 1
 
2.9%
Other values (25) 25
71.4%
2024-01-10T05:26:53.086129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5
 
4.7%
5
 
4.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (54) 71
66.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 105
98.1%
Space Separator 1
 
0.9%
Other Punctuation 1
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
 
4.8%
5
 
4.8%
4
 
3.8%
4
 
3.8%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (52) 69
65.7%
Space Separator
ValueCountFrequency (%)
1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 105
98.1%
Common 2
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
 
4.8%
5
 
4.8%
4
 
3.8%
4
 
3.8%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (52) 69
65.7%
Common
ValueCountFrequency (%)
1
50.0%
, 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 105
98.1%
ASCII 2
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5
 
4.8%
5
 
4.8%
4
 
3.8%
4
 
3.8%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (52) 69
65.7%
ASCII
ValueCountFrequency (%)
1
50.0%
, 1
50.0%

주소
Text

MISSING 

Distinct34
Distinct (%)100.0%
Missing5
Missing (%)12.8%
Memory size444.0 B
2024-01-10T05:26:53.256169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length17.352941
Min length14

Characters and Unicode

Total characters590
Distinct characters26
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row계룡시 두마면 제1산단로 26-21
2nd row계룡시 두마면 제1산단로 16-9
3rd row계룡시 두마면 제1산단로 26-31
4th row계룡시 두마면 제1산단로 40-37
5th row계룡시 두마면 제1산단로 40-33
ValueCountFrequency (%)
계룡시 34
24.8%
두마면 34
24.8%
제1산단로 19
13.9%
입암길 15
10.9%
76-42 1
 
0.7%
42-45 1
 
0.7%
72 1
 
0.7%
36 1
 
0.7%
76-18 1
 
0.7%
78 1
 
0.7%
Other values (29) 29
21.2%
2024-01-10T05:26:53.536874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
104
17.6%
34
 
5.8%
34
 
5.8%
34
 
5.8%
34
 
5.8%
34
 
5.8%
34
 
5.8%
1 33
 
5.6%
2 26
 
4.4%
- 25
 
4.2%
Other values (16) 198
33.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 325
55.1%
Decimal Number 135
22.9%
Space Separator 104
 
17.6%
Dash Punctuation 25
 
4.2%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
10.5%
34
10.5%
34
10.5%
34
10.5%
34
10.5%
34
10.5%
19
5.8%
19
5.8%
19
5.8%
19
5.8%
Other values (3) 45
13.8%
Decimal Number
ValueCountFrequency (%)
1 33
24.4%
2 26
19.3%
4 16
11.9%
3 13
 
9.6%
6 13
 
9.6%
7 11
 
8.1%
5 7
 
5.2%
0 6
 
4.4%
8 5
 
3.7%
9 5
 
3.7%
Space Separator
ValueCountFrequency (%)
104
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 325
55.1%
Common 265
44.9%

Most frequent character per script

Common
ValueCountFrequency (%)
104
39.2%
1 33
 
12.5%
2 26
 
9.8%
- 25
 
9.4%
4 16
 
6.0%
3 13
 
4.9%
6 13
 
4.9%
7 11
 
4.2%
5 7
 
2.6%
0 6
 
2.3%
Other values (3) 11
 
4.2%
Hangul
ValueCountFrequency (%)
34
10.5%
34
10.5%
34
10.5%
34
10.5%
34
10.5%
34
10.5%
19
5.8%
19
5.8%
19
5.8%
19
5.8%
Other values (3) 45
13.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 325
55.1%
ASCII 265
44.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
104
39.2%
1 33
 
12.5%
2 26
 
9.8%
- 25
 
9.4%
4 16
 
6.0%
3 13
 
4.9%
6 13
 
4.9%
7 11
 
4.2%
5 7
 
2.6%
0 6
 
2.3%
Other values (3) 11
 
4.2%
Hangul
ValueCountFrequency (%)
34
10.5%
34
10.5%
34
10.5%
34
10.5%
34
10.5%
34
10.5%
19
5.8%
19
5.8%
19
5.8%
19
5.8%
Other values (3) 45
13.8%

연락처
Text

MISSING 

Distinct34
Distinct (%)100.0%
Missing5
Missing (%)12.8%
Memory size444.0 B
2024-01-10T05:26:53.726383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.176471
Min length12

Characters and Unicode

Total characters414
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row042-719-9200
2nd row042-841-2357
3rd row042-222-0021
4th row070-7773-2346
5th row042-627-4647
ValueCountFrequency (%)
042-628-6826 1
 
2.9%
042-634-9734 1
 
2.9%
042-631-8801 1
 
2.9%
042-545-3007 1
 
2.9%
042-824-1614 1
 
2.9%
042-482-0399 1
 
2.9%
042-841-5080 1
 
2.9%
042-864-2462 1
 
2.9%
042-542-5815 1
 
2.9%
042-222-0021 1
 
2.9%
Other values (24) 24
70.6%
2024-01-10T05:26:54.030194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 70
16.9%
- 68
16.4%
2 60
14.5%
4 53
12.8%
3 29
7.0%
1 26
 
6.3%
8 25
 
6.0%
6 24
 
5.8%
5 22
 
5.3%
7 21
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 346
83.6%
Dash Punctuation 68
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 70
20.2%
2 60
17.3%
4 53
15.3%
3 29
8.4%
1 26
 
7.5%
8 25
 
7.2%
6 24
 
6.9%
5 22
 
6.4%
7 21
 
6.1%
9 16
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 68
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 414
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 70
16.9%
- 68
16.4%
2 60
14.5%
4 53
12.8%
3 29
7.0%
1 26
 
6.3%
8 25
 
6.0%
6 24
 
5.8%
5 22
 
5.3%
7 21
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 414
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 70
16.9%
- 68
16.4%
2 60
14.5%
4 53
12.8%
3 29
7.0%
1 26
 
6.3%
8 25
 
6.0%
6 24
 
5.8%
5 22
 
5.3%
7 21
 
5.1%

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing39
Missing (%)100.0%
Memory size483.0 B

Correlations

2024-01-10T05:26:54.121428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
산단명입주기업명대표자주소연락처
산단명1.0001.0001.0001.0001.000
입주기업명1.0001.0001.0001.0001.000
대표자1.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.000
연락처1.0001.0001.0001.0001.000

Missing values

2024-01-10T05:26:51.749847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:26:51.831647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T05:26:51.919448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

산단명입주기업명대표자주소연락처Unnamed: 5
0계룡제1산업단지㈜아워홈구지은계룡시 두마면 제1산단로 26-21042-719-9200<NA>
1계룡제1산업단지소이미푸드㈜신희수계룡시 두마면 제1산단로 16-9042-841-2357<NA>
2계룡제1산업단지농업회사법인㈜퍼스프이충관계룡시 두마면 제1산단로 26-31042-222-0021<NA>
3계룡제1산업단지푸드텍농업회사법인㈜한동훈계룡시 두마면 제1산단로 40-37070-7773-2346<NA>
4계룡제1산업단지㈜비타바이오유기종계룡시 두마면 제1산단로 40-33042-627-4647<NA>
5계룡제1산업단지㈜해련식품박지혜계룡시 두마면 제1산단로 40-29042-583-2145<NA>
6계룡제1산업단지㈜내담에프앤비최동재계룡시 두마면 제1산단로 40-21070-8223-2374<NA>
7계룡제1산업단지㈜마메든도어장금숙계룡시 두마면 제1산단로 40-7042-542-4007<NA>
8계룡제1산업단지㈜계룡글라스텍장광호계룡시 두마면 제1산단로 30042-825-9390<NA>
9계룡제1산업단지㈜굿스굿박원규계룡시 두마면 제1산단로 38042-628-6826<NA>
산단명입주기업명대표자주소연락처Unnamed: 5
29계룡제2산업단지농경마을김광철계룡시 두마면 입암길 42-29042-489-9937<NA>
30계룡제2산업단지㈜코아팀즈최용기계룡시 두마면 입암길 42-11042-825-5304<NA>
31계룡제2산업단지㈜코렌스알티엑스천세욱, 조형근계룡시 두마면 입암길 76-32042-863-9380<NA>
32계룡제2산업단지㈜와이투아이양태현계룡시 두마면 입암길 76-42042-633-1057<NA>
33계룡제2산업단지㈜누에보컴퍼니황진우계룡시 두마면 입암길 72042-621-4260<NA>
34<NA><NA><NA><NA><NA><NA>
35<NA><NA><NA><NA><NA><NA>
36<NA><NA><NA><NA><NA><NA>
37<NA><NA><NA><NA><NA><NA>
38<NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

산단명입주기업명대표자주소연락처# duplicates
0<NA><NA><NA><NA><NA>5