Overview

Dataset statistics

Number of variables4
Number of observations109
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.6 KiB
Average record size in memory34.2 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description충청남도 홍성군 지역내 산업단지 입주기업 현황으로 단지명으로 분류를 하여 ,회사명, 업종분류를 하여 업종별로도 사업의 분류를 할수있게 기재되어있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=438&beforeMenuCd=DOM_000000201001001000&publicdatapk=15028983

Alerts

연번 is highly overall correlated with 단지명High correlation
단지명 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:44:04.433805
Analysis finished2024-01-09 22:44:04.856318
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct109
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean55
Minimum1
Maximum109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-01-10T07:44:04.927639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.4
Q128
median55
Q382
95-th percentile103.6
Maximum109
Range108
Interquartile range (IQR)54

Descriptive statistics

Standard deviation31.609598
Coefficient of variation (CV)0.57471996
Kurtosis-1.2
Mean55
Median Absolute Deviation (MAD)27
Skewness0
Sum5995
Variance999.16667
MonotonicityStrictly increasing
2024-01-10T07:44:05.078198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
70 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
77 1
 
0.9%
76 1
 
0.9%
75 1
 
0.9%
74 1
 
0.9%
Other values (99) 99
90.8%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
109 1
0.9%
108 1
0.9%
107 1
0.9%
106 1
0.9%
105 1
0.9%
104 1
0.9%
103 1
0.9%
102 1
0.9%
101 1
0.9%
100 1
0.9%

단지명
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size1004.0 B
홍성결성전문농공단지
29 
홍성갈산전문농공단지
16 
홍성광천농공단지
15 
홍성구항농공단지
14 
홍성내포도시첨단산업단지
Other values (4)
26 

Length

Max length12
Median length11
Mean length9.5504587
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row홍성일반산업단지
2nd row홍성일반산업단지
3rd row홍성일반산업단지
4th row홍성일반산업단지
5th row홍성일반산업단지

Common Values

ValueCountFrequency (%)
홍성결성전문농공단지 29
26.6%
홍성갈산전문농공단지 16
14.7%
홍성광천농공단지 15
13.8%
홍성구항농공단지 14
12.8%
홍성내포도시첨단산업단지 9
 
8.3%
홍성광천김특화농공단지 9
 
8.3%
홍성은하전문농공단지 8
 
7.3%
홍성일반산업단지 7
 
6.4%
홍성은하농공단지 2
 
1.8%

Length

2024-01-10T07:44:05.217960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:44:05.331523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
홍성결성전문농공단지 29
26.6%
홍성갈산전문농공단지 16
14.7%
홍성광천농공단지 15
13.8%
홍성구항농공단지 14
12.8%
홍성내포도시첨단산업단지 9
 
8.3%
홍성광천김특화농공단지 9
 
8.3%
홍성은하전문농공단지 8
 
7.3%
홍성일반산업단지 7
 
6.4%
홍성은하농공단지 2
 
1.8%
Distinct108
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size1004.0 B
2024-01-10T07:44:05.582634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length14
Mean length7.9724771
Min length4

Characters and Unicode

Total characters869
Distinct characters174
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique107 ?
Unique (%)98.2%

Sample

1st row(주)경남금속
2nd row(주)수천중공업
3rd row(주)우심시스템
4th row성호티에스(주)
5th row일진전기(주)
ValueCountFrequency (%)
주식회사 13
 
9.5%
주)동신포리마 4
 
2.9%
원강금속(주 3
 
2.2%
2공장 3
 
2.2%
제2공장 3
 
2.2%
벽산 2
 
1.5%
㈜한진오토모티브 2
 
1.5%
농업회사법인 2
 
1.5%
참그로 2
 
1.5%
강남이앤알(주 1
 
0.7%
Other values (102) 102
74.5%
2024-01-10T07:44:05.946540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
85
 
9.8%
( 71
 
8.2%
) 71
 
8.2%
28
 
3.2%
23
 
2.6%
18
 
2.1%
18
 
2.1%
17
 
2.0%
16
 
1.8%
15
 
1.7%
Other values (164) 507
58.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 684
78.7%
Open Punctuation 71
 
8.2%
Close Punctuation 71
 
8.2%
Space Separator 28
 
3.2%
Decimal Number 9
 
1.0%
Other Symbol 5
 
0.6%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
85
 
12.4%
23
 
3.4%
18
 
2.6%
18
 
2.6%
17
 
2.5%
16
 
2.3%
15
 
2.2%
14
 
2.0%
14
 
2.0%
13
 
1.9%
Other values (156) 451
65.9%
Decimal Number
ValueCountFrequency (%)
2 7
77.8%
4 1
 
11.1%
3 1
 
11.1%
Open Punctuation
ValueCountFrequency (%)
( 71
100.0%
Close Punctuation
ValueCountFrequency (%)
) 71
100.0%
Space Separator
ValueCountFrequency (%)
28
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 689
79.3%
Common 180
 
20.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
85
 
12.3%
23
 
3.3%
18
 
2.6%
18
 
2.6%
17
 
2.5%
16
 
2.3%
15
 
2.2%
14
 
2.0%
14
 
2.0%
13
 
1.9%
Other values (157) 456
66.2%
Common
ValueCountFrequency (%)
( 71
39.4%
) 71
39.4%
28
 
15.6%
2 7
 
3.9%
4 1
 
0.6%
3 1
 
0.6%
- 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 684
78.7%
ASCII 180
 
20.7%
None 5
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
85
 
12.4%
23
 
3.4%
18
 
2.6%
18
 
2.6%
17
 
2.5%
16
 
2.3%
15
 
2.2%
14
 
2.0%
14
 
2.0%
13
 
1.9%
Other values (156) 451
65.9%
ASCII
ValueCountFrequency (%)
( 71
39.4%
) 71
39.4%
28
 
15.6%
2 7
 
3.9%
4 1
 
0.6%
3 1
 
0.6%
- 1
 
0.6%
None
ValueCountFrequency (%)
5
100.0%
Distinct73
Distinct (%)67.0%
Missing0
Missing (%)0.0%
Memory size1004.0 B
2024-01-10T07:44:06.233892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length24
Mean length18.834862
Min length7

Characters and Unicode

Total characters2053
Distinct characters168
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)53.2%

Sample

1st row알루미늄주물 주조업
2nd row육상 금속 골조 구조재 제조업 외 1 종
3rd row컴퓨터 프린터 제조업
4th row육상 금속 골조 구조재 제조업
5th row변압기 제조업 외 3 종
ValueCountFrequency (%)
제조업 92
 
13.4%
86
 
12.5%
56
 
8.2%
45
 
6.6%
30
 
4.4%
신품 24
 
3.5%
자동차용 23
 
3.3%
부품 22
 
3.2%
3 18
 
2.6%
기타 13
 
1.9%
Other values (136) 278
40.5%
2024-01-10T07:44:06.625372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
578
28.2%
122
 
5.9%
115
 
5.6%
106
 
5.2%
86
 
4.2%
72
 
3.5%
56
 
2.7%
45
 
2.2%
43
 
2.1%
38
 
1.9%
Other values (158) 792
38.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1410
68.7%
Space Separator 578
28.2%
Decimal Number 58
 
2.8%
Other Punctuation 7
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
122
 
8.7%
115
 
8.2%
106
 
7.5%
86
 
6.1%
72
 
5.1%
56
 
4.0%
45
 
3.2%
43
 
3.0%
38
 
2.7%
35
 
2.5%
Other values (147) 692
49.1%
Decimal Number
ValueCountFrequency (%)
3 19
32.8%
1 14
24.1%
2 7
 
12.1%
4 6
 
10.3%
5 4
 
6.9%
6 3
 
5.2%
7 2
 
3.4%
8 2
 
3.4%
9 1
 
1.7%
Space Separator
ValueCountFrequency (%)
578
100.0%
Other Punctuation
ValueCountFrequency (%)
, 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1410
68.7%
Common 643
31.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
122
 
8.7%
115
 
8.2%
106
 
7.5%
86
 
6.1%
72
 
5.1%
56
 
4.0%
45
 
3.2%
43
 
3.0%
38
 
2.7%
35
 
2.5%
Other values (147) 692
49.1%
Common
ValueCountFrequency (%)
578
89.9%
3 19
 
3.0%
1 14
 
2.2%
2 7
 
1.1%
, 7
 
1.1%
4 6
 
0.9%
5 4
 
0.6%
6 3
 
0.5%
7 2
 
0.3%
8 2
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1410
68.7%
ASCII 643
31.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
578
89.9%
3 19
 
3.0%
1 14
 
2.2%
2 7
 
1.1%
, 7
 
1.1%
4 6
 
0.9%
5 4
 
0.6%
6 3
 
0.5%
7 2
 
0.3%
8 2
 
0.3%
Hangul
ValueCountFrequency (%)
122
 
8.7%
115
 
8.2%
106
 
7.5%
86
 
6.1%
72
 
5.1%
56
 
4.0%
45
 
3.2%
43
 
3.0%
38
 
2.7%
35
 
2.5%
Other values (147) 692
49.1%

Interactions

2024-01-10T07:44:04.659464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:44:06.709940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번단지명업종명
연번1.0000.9230.964
단지명0.9231.0000.987
업종명0.9640.9871.000
2024-01-10T07:44:06.784995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번단지명
연번1.0000.751
단지명0.7511.000

Missing values

2024-01-10T07:44:04.753927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:44:04.825211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번단지명회사명업종명
01홍성일반산업단지(주)경남금속알루미늄주물 주조업
12홍성일반산업단지(주)수천중공업육상 금속 골조 구조재 제조업 외 1 종
23홍성일반산업단지(주)우심시스템컴퓨터 프린터 제조업
34홍성일반산업단지성호티에스(주)육상 금속 골조 구조재 제조업
45홍성일반산업단지일진전기(주)변압기 제조업 외 3 종
56홍성일반산업단지주식회사 벽산폴리스티렌 발포 성형제품 제조업
67홍성일반산업단지주식회사 벽산 홍성 제2공장1차 유리제품, 유리섬유 및 광학용 유리 제조업
78홍성내포도시첨단산업단지(주)동양테크윈유선 통신장비 제조업 외 6 종
89홍성내포도시첨단산업단지(주)월산이앤씨배전반 및 전기 자동제어반 제조업 외 5 종
910홍성내포도시첨단산업단지(주)유니에어공조산업용 냉장 및 냉동 장비 제조업 외 3 종
연번단지명회사명업종명
99100홍성은하전문농공단지중앙식품수산식물 가공 및 저장 처리업
100101홍성광천김특화농공단지(주)김노리수산식물 가공 및 저장 처리업
101102홍성광천김특화농공단지(주)솔뫼에프엔씨수산식물 가공 및 저장 처리업
102103홍성광천김특화농공단지(주)해저식품수산식물 가공 및 저장 처리업
103104홍성광천김특화농공단지광천농업협동조합수산식물 가공 및 저장 처리업
104105홍성광천김특화농공단지광천조양식품수산식물 가공 및 저장 처리업
105106홍성광천김특화농공단지서해수산푸드(주)수산동물 건조 및 염장품 제조업 외 1 종
106107홍성광천김특화농공단지영어조합법인 최강식품수산식물 가공 및 저장 처리업
107108홍성광천김특화농공단지주식회사 해저김수산식물 가공 및 저장 처리업
108109홍성광천김특화농공단지천일식품(주)면류, 마카로니 및 유사식품 제조업