Overview

Dataset statistics

Number of variables6
Number of observations167
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.1 KiB
Average record size in memory49.8 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description충청남도 예산군 산업단지 내 입주기업 현황에 대한 데이터로 산업단지명, 업체명, 주소, 전화번호의 정보를 제공합니다.
Author충청남도 예산군
URLhttps://www.data.go.kr/data/15095873/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 산업단지명High correlation
산업단지명 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:18:08.129664
Analysis finished2023-12-12 08:18:08.639164
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct167
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean84
Minimum1
Maximum167
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-12T17:18:08.704233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9.3
Q142.5
median84
Q3125.5
95-th percentile158.7
Maximum167
Range166
Interquartile range (IQR)83

Descriptive statistics

Standard deviation48.35287
Coefficient of variation (CV)0.5756294
Kurtosis-1.2
Mean84
Median Absolute Deviation (MAD)42
Skewness0
Sum14028
Variance2338
MonotonicityStrictly increasing
2023-12-12T17:18:08.823090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
116 1
 
0.6%
108 1
 
0.6%
109 1
 
0.6%
110 1
 
0.6%
111 1
 
0.6%
112 1
 
0.6%
113 1
 
0.6%
114 1
 
0.6%
115 1
 
0.6%
Other values (157) 157
94.0%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
167 1
0.6%
166 1
0.6%
165 1
0.6%
164 1
0.6%
163 1
0.6%
162 1
0.6%
161 1
0.6%
160 1
0.6%
159 1
0.6%
158 1
0.6%

산업단지명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
예산예당일반산업단지
72 
예산일반산업단지
70 
예산신소재일반산업단지
25 

Length

Max length11
Median length10
Mean length9.3113772
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row예산일반산업단지
2nd row예산일반산업단지
3rd row예산일반산업단지
4th row예산일반산업단지
5th row예산일반산업단지

Common Values

ValueCountFrequency (%)
예산예당일반산업단지 72
43.1%
예산일반산업단지 70
41.9%
예산신소재일반산업단지 25
 
15.0%

Length

2023-12-12T17:18:08.939829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:18:09.031047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
예산예당일반산업단지 72
43.1%
예산일반산업단지 70
41.9%
예산신소재일반산업단지 25
 
15.0%
Distinct158
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T17:18:09.226128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length8.1736527
Min length3

Characters and Unicode

Total characters1365
Distinct characters209
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique150 ?
Unique (%)89.8%

Sample

1st row(주)경인양행
2nd row(주)나이스엘엠에스 주조공장
3rd row(주)남영테크
4th row(주)남영테크
5th row(주)네오오토 예산3공장
ValueCountFrequency (%)
주식회사 4
 
2.2%
주)신호인더스트리 3
 
1.6%
주)네오오토 2
 
1.1%
주)명배메탈 2
 
1.1%
에이치피코리아(주 2
 
1.1%
예당공장 2
 
1.1%
이엔지스틸(주 2
 
1.1%
예산공장 2
 
1.1%
유한책임회사 2
 
1.1%
주)영강 2
 
1.1%
Other values (157) 162
87.6%
2023-12-12T17:18:09.684458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
151
 
11.1%
( 141
 
10.3%
) 141
 
10.3%
48
 
3.5%
47
 
3.4%
46
 
3.4%
20
 
1.5%
20
 
1.5%
19
 
1.4%
19
 
1.4%
Other values (199) 713
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1055
77.3%
Open Punctuation 141
 
10.3%
Close Punctuation 141
 
10.3%
Space Separator 19
 
1.4%
Decimal Number 7
 
0.5%
Other Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
151
 
14.3%
48
 
4.5%
47
 
4.5%
46
 
4.4%
20
 
1.9%
20
 
1.9%
19
 
1.8%
18
 
1.7%
17
 
1.6%
16
 
1.5%
Other values (192) 653
61.9%
Decimal Number
ValueCountFrequency (%)
2 4
57.1%
1 2
28.6%
3 1
 
14.3%
Open Punctuation
ValueCountFrequency (%)
( 141
100.0%
Close Punctuation
ValueCountFrequency (%)
) 141
100.0%
Space Separator
ValueCountFrequency (%)
19
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1057
77.4%
Common 308
 
22.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
151
 
14.3%
48
 
4.5%
47
 
4.4%
46
 
4.4%
20
 
1.9%
20
 
1.9%
19
 
1.8%
18
 
1.7%
17
 
1.6%
16
 
1.5%
Other values (193) 655
62.0%
Common
ValueCountFrequency (%)
( 141
45.8%
) 141
45.8%
19
 
6.2%
2 4
 
1.3%
1 2
 
0.6%
3 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1055
77.3%
ASCII 308
 
22.6%
None 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
151
 
14.3%
48
 
4.5%
47
 
4.5%
46
 
4.4%
20
 
1.9%
20
 
1.9%
19
 
1.8%
18
 
1.7%
17
 
1.6%
16
 
1.5%
Other values (192) 653
61.9%
ASCII
ValueCountFrequency (%)
( 141
45.8%
) 141
45.8%
19
 
6.2%
2 4
 
1.3%
1 2
 
0.6%
3 1
 
0.3%
None
ValueCountFrequency (%)
2
100.0%

주소
Text

Distinct157
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T17:18:09.964518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length39
Mean length26.976048
Min length19

Characters and Unicode

Total characters4505
Distinct characters79
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique147 ?
Unique (%)88.0%

Sample

1st row충청남도 예산군 응봉면 산단1길 7
2nd row충청남도 예산군 삽교읍 산단1길 140, (예산일반산업단지) (삽교읍)
3rd row충청남도 예산군 삽교읍 산단3길 16, (예산일반산업단지)
4th row충청남도 예산군 삽교읍 효림리 519-2
5th row충청남도 예산군 삽교읍 산단2길 85
ValueCountFrequency (%)
충청남도 167
17.3%
예산군 167
17.3%
고덕면 97
 
10.0%
삽교읍 63
 
6.5%
산단3길 24
 
2.5%
오추리 19
 
2.0%
18
 
1.9%
산단2길 17
 
1.8%
산단1길 17
 
1.8%
예당산단4길 14
 
1.4%
Other values (175) 363
37.6%
2023-12-12T17:18:10.379605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
799
17.7%
356
 
7.9%
255
 
5.7%
171
 
3.8%
171
 
3.8%
167
 
3.7%
167
 
3.7%
167
 
3.7%
162
 
3.6%
1 133
 
3.0%
Other values (69) 1957
43.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2909
64.6%
Space Separator 799
 
17.7%
Decimal Number 665
 
14.8%
Other Punctuation 58
 
1.3%
Open Punctuation 24
 
0.5%
Close Punctuation 24
 
0.5%
Dash Punctuation 22
 
0.5%
Uppercase Letter 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
356
 
12.2%
255
 
8.8%
171
 
5.9%
171
 
5.9%
167
 
5.7%
167
 
5.7%
167
 
5.7%
162
 
5.6%
114
 
3.9%
106
 
3.6%
Other values (52) 1073
36.9%
Decimal Number
ValueCountFrequency (%)
1 133
20.0%
2 116
17.4%
3 84
12.6%
0 65
9.8%
4 55
8.3%
6 55
8.3%
7 41
 
6.2%
5 40
 
6.0%
9 39
 
5.9%
8 37
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
B 2
50.0%
L 2
50.0%
Space Separator
ValueCountFrequency (%)
799
100.0%
Other Punctuation
ValueCountFrequency (%)
, 58
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2909
64.6%
Common 1592
35.3%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
356
 
12.2%
255
 
8.8%
171
 
5.9%
171
 
5.9%
167
 
5.7%
167
 
5.7%
167
 
5.7%
162
 
5.6%
114
 
3.9%
106
 
3.6%
Other values (52) 1073
36.9%
Common
ValueCountFrequency (%)
799
50.2%
1 133
 
8.4%
2 116
 
7.3%
3 84
 
5.3%
0 65
 
4.1%
, 58
 
3.6%
4 55
 
3.5%
6 55
 
3.5%
7 41
 
2.6%
5 40
 
2.5%
Other values (5) 146
 
9.2%
Latin
ValueCountFrequency (%)
B 2
50.0%
L 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2909
64.6%
ASCII 1596
35.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
799
50.1%
1 133
 
8.3%
2 116
 
7.3%
3 84
 
5.3%
0 65
 
4.1%
, 58
 
3.6%
4 55
 
3.4%
6 55
 
3.4%
7 41
 
2.6%
5 40
 
2.5%
Other values (7) 150
 
9.4%
Hangul
ValueCountFrequency (%)
356
 
12.2%
255
 
8.8%
171
 
5.9%
171
 
5.9%
167
 
5.7%
167
 
5.7%
167
 
5.7%
162
 
5.6%
114
 
3.9%
106
 
3.6%
Other values (52) 1073
36.9%
Distinct151
Distinct (%)90.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T17:18:10.729008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.113772
Min length11

Characters and Unicode

Total characters2023
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique140 ?
Unique (%)83.8%

Sample

1st row02-3660-7821
2nd row041-331-6390
3rd row041-338-9691
4th row041-338-9691
5th row041-337-1730
ValueCountFrequency (%)
041-337-7411 3
 
1.8%
041-338-5240 3
 
1.8%
000-0000-0000 3
 
1.8%
041-337-3881 3
 
1.8%
032-760-2114 3
 
1.8%
041-338-9691 2
 
1.2%
041-404-8040 2
 
1.2%
041-337-1730 2
 
1.2%
041-532-8195 2
 
1.2%
041-533-9322 2
 
1.2%
Other values (141) 142
85.0%
2023-12-12T17:18:11.253664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 334
16.5%
0 326
16.1%
3 276
13.6%
1 252
12.5%
4 214
10.6%
7 131
 
6.5%
8 123
 
6.1%
2 109
 
5.4%
5 99
 
4.9%
6 88
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1689
83.5%
Dash Punctuation 334
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 326
19.3%
3 276
16.3%
1 252
14.9%
4 214
12.7%
7 131
7.8%
8 123
 
7.3%
2 109
 
6.5%
5 99
 
5.9%
6 88
 
5.2%
9 71
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 334
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2023
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 334
16.5%
0 326
16.1%
3 276
13.6%
1 252
12.5%
4 214
10.6%
7 131
 
6.5%
8 123
 
6.1%
2 109
 
5.4%
5 99
 
4.9%
6 88
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2023
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 334
16.5%
0 326
16.1%
3 276
13.6%
1 252
12.5%
4 214
10.6%
7 131
 
6.5%
8 123
 
6.1%
2 109
 
5.4%
5 99
 
4.9%
6 88
 
4.3%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-09-15
167 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-09-15
2nd row2023-09-15
3rd row2023-09-15
4th row2023-09-15
5th row2023-09-15

Common Values

ValueCountFrequency (%)
2023-09-15 167
100.0%

Length

2023-12-12T17:18:11.445075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:18:11.580047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-09-15 167
100.0%

Interactions

2023-12-12T17:18:08.370532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:18:11.661644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번산업단지명
연번1.0000.937
산업단지명0.9371.000
2023-12-12T17:18:11.752136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번산업단지명
연번1.0000.898
산업단지명0.8981.000

Missing values

2023-12-12T17:18:08.495204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:18:08.599599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번산업단지명입주업체명주소전화번호데이터기준일자
01예산일반산업단지(주)경인양행충청남도 예산군 응봉면 산단1길 702-3660-78212023-09-15
12예산일반산업단지(주)나이스엘엠에스 주조공장충청남도 예산군 삽교읍 산단1길 140, (예산일반산업단지) (삽교읍)041-331-63902023-09-15
23예산일반산업단지(주)남영테크충청남도 예산군 삽교읍 산단3길 16, (예산일반산업단지)041-338-96912023-09-15
34예산일반산업단지(주)남영테크충청남도 예산군 삽교읍 효림리 519-2041-338-96912023-09-15
45예산일반산업단지(주)네오오토 예산3공장충청남도 예산군 삽교읍 산단2길 85041-337-17302023-09-15
56예산일반산업단지(주)네오오토 예산공장충청남도 예산군 삽교읍 효림리 514번지041-337-17302023-09-15
67예산일반산업단지(주)대복식품충청남도 예산군 삽교읍 산단3길 87, (예산일반산업단지내) 외 1필지041-664-39762023-09-15
78예산일반산업단지(주)대성비엘에스충청남도 예산군 삽교읍 산단3길 226, 309호 309호041-337-38812023-09-15
89예산일반산업단지(주)더본코리아충청남도 예산군 응봉면 산단1길 102-549-38642023-09-15
910예산일반산업단지(주)동성화학충청남도 예산군 삽교읍 효림리 174-10번지 예산일반산업단지051-200-45712023-09-15
연번산업단지명입주업체명주소전화번호데이터기준일자
157158예산신소재일반산업단지(주)한민에코텍충청남도 예산군 고덕면 상몽리 1025 외 1필지041-532-81952023-09-15
158159예산신소재일반산업단지대한금속공업(주)충청남도 예산군 고덕면 상몽산단1길 8041-337-52112023-09-15
159160예산신소재일반산업단지에이비이씨산업(주)충청남도 예산군 고덕면 상몽리 1020번지052-255-90002023-09-15
160161예산신소재일반산업단지에이치케이스틸 주식회사충청남도 예산군 고덕면 상몽산단2길 12041-404-74062023-09-15
161162예산신소재일반산업단지유스코(주)충청남도 예산군 고덕면 상몽리 1014번지041-6244-35332023-09-15
162163예산신소재일반산업단지이엔지스틸(주) 예산1공장충청남도 예산군 고덕면 상몽산단2길 51 외 2필지041-533-93222023-09-15
163164예산신소재일반산업단지이엔지스틸(주) 예산2공장충청남도 예산군 고덕면 상몽산단2길 54 외 1필지041-533-93222023-09-15
164165예산신소재일반산업단지일신그린철강(주)충청남도 예산군 고덕면 상몽리 산 36-67번지 BL 3-402-6265-03032023-09-15
165166예산신소재일반산업단지주식회사 씨케이텍충청남도 예산군 고덕면 상몽산단2길 61070-4651-42412023-09-15
166167예산신소재일반산업단지플래티넘맥주(주)충청남도 예산군 고덕면 상몽산단2길 40041-337-60032023-09-15