Overview

Dataset statistics

Number of variables8
Number of observations48
Missing cells7
Missing cells (%)1.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory67.8 B

Variable types

Categorical1
Text4
DateTime2
Numeric1

Dataset

Description충청남도 부여군 산업단지 내 입주기업 현황입니다.(산업단지명, 기업명, 사업장주소, 설립일자, 종업원수, 전화번호, 등)
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=419&beforeMenuCd=DOM_000000201001001000&publicdatapk=15053311

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 7 (14.6%) missing valuesMissing

Reproduction

Analysis started2024-01-09 22:12:54.299731
Analysis finished2024-01-09 22:12:54.888044
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

산업단지명
Categorical

Distinct5
Distinct (%)10.4%
Missing0
Missing (%)0.0%
Memory size516.0 B
은산산업단지
19 
홍산산업단지
12 
은산2산업단지
임천산업단지
장암산업단지
 
1

Length

Max length7
Median length6
Mean length6.1875
Min length6

Unique

Unique1 ?
Unique (%)2.1%

Sample

1st row은산산업단지
2nd row은산산업단지
3rd row은산산업단지
4th row은산산업단지
5th row은산산업단지

Common Values

ValueCountFrequency (%)
은산산업단지 19
39.6%
홍산산업단지 12
25.0%
은산2산업단지 9
18.8%
임천산업단지 7
 
14.6%
장암산업단지 1
 
2.1%

Length

2024-01-10T07:12:54.937434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:12:55.021522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
은산산업단지 19
39.6%
홍산산업단지 12
25.0%
은산2산업단지 9
18.8%
임천산업단지 7
 
14.6%
장암산업단지 1
 
2.1%
Distinct46
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size516.0 B
2024-01-10T07:12:55.190343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length6.0833333
Min length3

Characters and Unicode

Total characters292
Distinct characters119
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)91.7%

Sample

1st row㈜리버앤텍
2nd row㈜우리면
3rd row㈜정성면
4th row밤뜨래영농조합법인
5th row금산인삼농업협동조합
ValueCountFrequency (%)
밤뜨래영농조합법인 2
 
3.9%
㈜뉴제일이엘이씨 2
 
3.9%
주식회사 2
 
3.9%
㈜리버앤텍 1
 
2.0%
에스원스틸㈜ 1
 
2.0%
대한폴리텍㈜ 1
 
2.0%
한국유기농업개발㈜ 1
 
2.0%
㈜삼정아코텍 1
 
2.0%
㈜해송피엘 1
 
2.0%
아이솔라에너지㈜ 1
 
2.0%
Other values (38) 38
74.5%
2024-01-10T07:12:55.465184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39
 
13.4%
15
 
5.1%
8
 
2.7%
7
 
2.4%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.7%
5
 
1.7%
5
 
1.7%
Other values (109) 190
65.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 250
85.6%
Other Symbol 39
 
13.4%
Space Separator 3
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
6.0%
8
 
3.2%
7
 
2.8%
6
 
2.4%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (107) 182
72.8%
Other Symbol
ValueCountFrequency (%)
39
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 289
99.0%
Common 3
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
13.5%
15
 
5.2%
8
 
2.8%
7
 
2.4%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.7%
5
 
1.7%
5
 
1.7%
Other values (108) 187
64.7%
Common
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 250
85.6%
None 39
 
13.4%
ASCII 3
 
1.0%

Most frequent character per block

None
ValueCountFrequency (%)
39
100.0%
Hangul
ValueCountFrequency (%)
15
 
6.0%
8
 
3.2%
7
 
2.8%
6
 
2.4%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (107) 182
72.8%
ASCII
ValueCountFrequency (%)
3
100.0%
Distinct37
Distinct (%)77.1%
Missing0
Missing (%)0.0%
Memory size516.0 B
2024-01-10T07:12:55.628413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length21
Mean length20.8125
Min length19

Characters and Unicode

Total characters999
Distinct characters34
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)62.5%

Sample

1st row충청남도 부여군 은산면 은산리 26
2nd row충청남도 부여군 은산면 은산리 16-14
3rd row충청남도 부여군 은산면 은산리 12
4th row충청남도 부여군 은산면 은산리 25-6
5th row충청남도 부여군 은산면 은산리 25-14
ValueCountFrequency (%)
충청남도 48
20.0%
부여군 48
20.0%
은산면 28
11.7%
은산리 19
 
7.9%
홍산면 12
 
5.0%
홍양리 12
 
5.0%
가중리 9
 
3.8%
임천면 7
 
2.9%
칠산리 7
 
2.9%
13 4
 
1.7%
Other values (38) 46
19.2%
2024-01-10T07:12:55.887370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
192
19.2%
66
 
6.6%
48
 
4.8%
48
 
4.8%
48
 
4.8%
48
 
4.8%
48
 
4.8%
48
 
4.8%
48
 
4.8%
48
 
4.8%
Other values (24) 357
35.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 624
62.5%
Space Separator 192
 
19.2%
Decimal Number 158
 
15.8%
Dash Punctuation 25
 
2.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
66
10.6%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
Other values (12) 126
20.2%
Decimal Number
ValueCountFrequency (%)
1 27
17.1%
5 25
15.8%
0 23
14.6%
3 21
13.3%
2 20
12.7%
6 16
10.1%
7 11
7.0%
9 9
 
5.7%
8 3
 
1.9%
4 3
 
1.9%
Space Separator
ValueCountFrequency (%)
192
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 624
62.5%
Common 375
37.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
66
10.6%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
Other values (12) 126
20.2%
Common
ValueCountFrequency (%)
192
51.2%
1 27
 
7.2%
- 25
 
6.7%
5 25
 
6.7%
0 23
 
6.1%
3 21
 
5.6%
2 20
 
5.3%
6 16
 
4.3%
7 11
 
2.9%
9 9
 
2.4%
Other values (2) 6
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 624
62.5%
ASCII 375
37.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
192
51.2%
1 27
 
7.2%
- 25
 
6.7%
5 25
 
6.7%
0 23
 
6.1%
3 21
 
5.6%
2 20
 
5.3%
6 16
 
4.3%
7 11
 
2.9%
9 9
 
2.4%
Other values (2) 6
 
1.6%
Hangul
ValueCountFrequency (%)
66
10.6%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
48
 
7.7%
Other values (12) 126
20.2%
Distinct36
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Memory size516.0 B
2024-01-10T07:12:56.048145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length25
Mean length22.479167
Min length19

Characters and Unicode

Total characters1079
Distinct characters33
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)60.4%

Sample

1st row충청남도 부여군 은산면 충의로 604
2nd row충청남도 부여군 은산면 충의로 602-32
3rd row충청남도 부여군 은산면 충의로622번길 31
4th row충청남도 부여군 은산면 충의로 602-27
5th row충청남도 부여군 은산면 충의로 602-12
ValueCountFrequency (%)
충청남도 48
19.9%
부여군 48
19.9%
은산면 28
11.6%
홍산면 12
 
5.0%
비홍로 12
 
5.0%
충의로 10
 
4.1%
충의로622번길 9
 
3.7%
은남로20번길 8
 
3.3%
임천면 7
 
2.9%
부흥로171번길 7
 
2.9%
Other values (36) 52
21.6%
2024-01-10T07:12:56.306798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
193
17.9%
68
 
6.3%
57
 
5.3%
2 57
 
5.3%
55
 
5.1%
48
 
4.4%
48
 
4.4%
48
 
4.4%
48
 
4.4%
48
 
4.4%
Other values (23) 409
37.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 676
62.7%
Decimal Number 198
 
18.4%
Space Separator 193
 
17.9%
Dash Punctuation 12
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
68
10.1%
57
 
8.4%
55
 
8.1%
48
 
7.1%
48
 
7.1%
48
 
7.1%
48
 
7.1%
48
 
7.1%
48
 
7.1%
40
 
5.9%
Other values (12) 168
24.9%
Decimal Number
ValueCountFrequency (%)
2 57
28.8%
1 29
14.6%
0 26
13.1%
6 23
11.6%
3 18
 
9.1%
7 16
 
8.1%
5 13
 
6.6%
4 10
 
5.1%
9 6
 
3.0%
Space Separator
ValueCountFrequency (%)
193
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 676
62.7%
Common 403
37.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
68
10.1%
57
 
8.4%
55
 
8.1%
48
 
7.1%
48
 
7.1%
48
 
7.1%
48
 
7.1%
48
 
7.1%
48
 
7.1%
40
 
5.9%
Other values (12) 168
24.9%
Common
ValueCountFrequency (%)
193
47.9%
2 57
 
14.1%
1 29
 
7.2%
0 26
 
6.5%
6 23
 
5.7%
3 18
 
4.5%
7 16
 
4.0%
5 13
 
3.2%
- 12
 
3.0%
4 10
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 676
62.7%
ASCII 403
37.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
193
47.9%
2 57
 
14.1%
1 29
 
7.2%
0 26
 
6.5%
6 23
 
5.7%
3 18
 
4.5%
7 16
 
4.0%
5 13
 
3.2%
- 12
 
3.0%
4 10
 
2.5%
Hangul
ValueCountFrequency (%)
68
10.1%
57
 
8.4%
55
 
8.1%
48
 
7.1%
48
 
7.1%
48
 
7.1%
48
 
7.1%
48
 
7.1%
48
 
7.1%
40
 
5.9%
Other values (12) 168
24.9%
Distinct46
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size516.0 B
Minimum1990-12-24 00:00:00
Maximum2023-10-30 00:00:00
2024-01-10T07:12:56.412218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:12:56.511870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)

종업원수
Real number (ℝ)

Distinct27
Distinct (%)56.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.791667
Minimum1
Maximum90
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2024-01-10T07:12:56.601013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q15
median11
Q321.5
95-th percentile63
Maximum90
Range89
Interquartile range (IQR)16.5

Descriptive statistics

Standard deviation18.925339
Coefficient of variation (CV)1.1270673
Kurtosis5.8539189
Mean16.791667
Median Absolute Deviation (MAD)7
Skewness2.3619676
Sum806
Variance358.16844
MonotonicityNot monotonic
2024-01-10T07:12:56.688325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
7 4
 
8.3%
2 4
 
8.3%
4 4
 
8.3%
6 3
 
6.2%
5 3
 
6.2%
10 2
 
4.2%
14 2
 
4.2%
11 2
 
4.2%
23 2
 
4.2%
12 2
 
4.2%
Other values (17) 20
41.7%
ValueCountFrequency (%)
1 1
 
2.1%
2 4
8.3%
3 1
 
2.1%
4 4
8.3%
5 3
6.2%
6 3
6.2%
7 4
8.3%
9 1
 
2.1%
10 2
4.2%
11 2
4.2%
ValueCountFrequency (%)
90 1
2.1%
72 1
2.1%
70 1
2.1%
50 1
2.1%
38 1
2.1%
29 1
2.1%
28 1
2.1%
26 2
4.2%
25 1
2.1%
23 2
4.2%

전화번호
Text

MISSING 

Distinct39
Distinct (%)95.1%
Missing7
Missing (%)14.6%
Memory size516.0 B
2024-01-10T07:12:56.849225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.04878
Min length12

Characters and Unicode

Total characters494
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)90.2%

Sample

1st row041-837-7522
2nd row041-832-1571
3rd row041-837-1571
4th row041-834-7700
5th row041-832-7884
ValueCountFrequency (%)
041-834-7700 2
 
4.9%
041-832-1571 2
 
4.9%
041-832-1286 1
 
2.4%
041-837-8370 1
 
2.4%
02-2602-5256 1
 
2.4%
041-833-9235 1
 
2.4%
041-408-7933 1
 
2.4%
041-355-2851 1
 
2.4%
070-4348-2248 1
 
2.4%
041-834-8991 1
 
2.4%
Other values (29) 29
70.7%
2024-01-10T07:12:57.122987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 82
16.6%
0 72
14.6%
1 63
12.8%
4 58
11.7%
3 55
11.1%
8 50
10.1%
7 33
6.7%
2 29
 
5.9%
5 22
 
4.5%
6 19
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 412
83.4%
Dash Punctuation 82
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 72
17.5%
1 63
15.3%
4 58
14.1%
3 55
13.3%
8 50
12.1%
7 33
8.0%
2 29
7.0%
5 22
 
5.3%
6 19
 
4.6%
9 11
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 82
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 494
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 82
16.6%
0 72
14.6%
1 63
12.8%
4 58
11.7%
3 55
11.1%
8 50
10.1%
7 33
6.7%
2 29
 
5.9%
5 22
 
4.5%
6 19
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 494
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 82
16.6%
0 72
14.6%
1 63
12.8%
4 58
11.7%
3 55
11.1%
8 50
10.1%
7 33
6.7%
2 29
 
5.9%
5 22
 
4.5%
6 19
 
3.8%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size516.0 B
Minimum2023-10-31 00:00:00
Maximum2023-10-31 00:00:00
2024-01-10T07:12:57.212929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:12:57.287003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-10T07:12:54.678737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:12:57.341526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
산업단지명기업명사업장 주소(지번)사업장 주소(도로명)설립일자종업원수전화번호
산업단지명1.0000.8501.0001.0001.0000.6750.861
기업명0.8501.0000.8750.8380.9860.9910.998
사업장 주소(지번)1.0000.8751.0001.0000.9900.9680.912
사업장 주소(도로명)1.0000.8381.0001.0000.9890.9640.892
설립일자1.0000.9860.9900.9891.0001.0001.000
종업원수0.6750.9910.9680.9641.0001.0000.000
전화번호0.8610.9980.9120.8921.0000.0001.000
2024-01-10T07:12:57.424028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종업원수산업단지명
종업원수1.0000.479
산업단지명0.4791.000

Missing values

2024-01-10T07:12:54.756844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:12:54.849581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

산업단지명기업명사업장 주소(지번)사업장 주소(도로명)설립일자종업원수전화번호데이터기준일자
0은산산업단지㈜리버앤텍충청남도 부여군 은산면 은산리 26충청남도 부여군 은산면 충의로 6042015-10-236041-837-75222023-10-31
1은산산업단지㈜우리면충청남도 부여군 은산면 은산리 16-14충청남도 부여군 은산면 충의로 602-322007-06-1829041-832-15712023-10-31
2은산산업단지㈜정성면충청남도 부여군 은산면 은산리 12충청남도 부여군 은산면 충의로622번길 312008-07-0113041-837-15712023-10-31
3은산산업단지밤뜨래영농조합법인충청남도 부여군 은산면 은산리 25-6충청남도 부여군 은산면 충의로 602-272005-11-0426041-834-77002023-10-31
4은산산업단지금산인삼농업협동조합충청남도 부여군 은산면 은산리 25-14충청남도 부여군 은산면 충의로 602-122003-05-1626041-832-78842023-10-31
5은산산업단지㈜비디텍충청남도 부여군 은산면 은산리 20충청남도 부여군 은산면 충의로 602-262019-01-0815041-837-44162023-10-31
6은산산업단지㈜뉴제일이엘이씨충청남도 부여군 은산면 은산리 12-10충청남도 부여군 은산면 충의로 622번길 352014-01-167041-837-01522023-10-31
7은산산업단지하나그린㈜충청남도 부여군 은산면 은산리 12-7충청남도 부여군 은산면 충의로622번길 302000-08-152041-837-01512023-10-31
8은산산업단지㈜맛푸름충청남도 부여군 은산면 은산리 15충청남도 부여군 은산면 충의로622번길 132016-05-167041-832-73052023-10-31
9은산산업단지백제라이팅충청남도 부여군 은산면 은산리 15충청남도 부여군 은산면 충의로622번길 132019-10-212041-833-01062023-10-31
산업단지명기업명사업장 주소(지번)사업장 주소(도로명)설립일자종업원수전화번호데이터기준일자
38홍산산업단지㈜비에스 부여공장충청남도 부여군 홍산면 홍양리 306-6충청남도 부여군 홍산면 비홍로 39-62006-05-1172041-836-10452023-10-31
39홍산산업단지㈜현호산업충청남도 부여군 홍산면 홍양리 306-5충청남도 부여군 홍산면 비홍로 39-202018-08-022<NA>2023-10-31
40홍산산업단지㈜에이시텍충청남도 부여군 홍산면 홍양리 306-7충청남도 부여군 홍산면 비홍로 492011-12-1412041-836-76132023-10-31
41홍산산업단지㈜케이지콘크리트충청남도 부여군 홍산면 홍양리 307-7충청남도 부여군 홍산면 비홍로 252020-03-092<NA>2023-10-31
42홍산산업단지㈜티제이콘크리트충청남도 부여군 홍산면 홍양리 307-4충청남도 부여군 홍산면 비홍로 272020-03-094041-837-83702023-10-31
43홍산산업단지동성이앤지㈜충청남도 부여군 홍산면 홍양리 307-1충청남도 부여군 홍산면 비홍로 312003-07-2911041-832-12862023-10-31
44홍산산업단지메인스틸㈜충청남도 부여군 홍산면 홍양리 307-1충청남도 부여군 홍산면 비홍로 312021-06-1510<NA>2023-10-31
45홍산산업단지준라이팅충청남도 부여군 홍산면 홍양리 307-1충청남도 부여군 홍산면 비홍로 312019-06-243041-833-92352023-10-31
46홍산산업단지㈜태산충청남도 부여군 홍산면 홍양리 306-1충청남도 부여군 홍산면 비홍로 552019-10-011202-2602-52562023-10-31
47홍산산업단지㈜에스엠산업충청남도 부여군 홍산면 홍양리 306-1충청남도 부여군 홍산면 비홍로 552020-05-046041-832-11492023-10-31