Overview

Dataset statistics

Number of variables8
Number of observations49
Missing cells7
Missing cells (%)1.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory67.7 B

Variable types

Categorical1
Text4
DateTime2
Numeric1

Dataset

Description충청남도 부여군 산업단지 내 입주기업 현황입니다.(산업단지명, 기업명, 사업장주소, 설립일자, 종업원수, 전화번호, 등)
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=419&beforeMenuCd=DOM_000000201001001000&publicdatapk=15053311

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 7 (14.3%) missing valuesMissing

Reproduction

Analysis started2024-01-09 22:13:02.374211
Analysis finished2024-01-09 22:13:02.933521
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

산업단지명
Categorical

Distinct5
Distinct (%)10.2%
Missing0
Missing (%)0.0%
Memory size524.0 B
은산산업단지
20 
홍산산업단지
12 
은산2산업단지
10 
임천산업단지
장암산업단지
 
1

Length

Max length7
Median length6
Mean length6.2040816
Min length6

Unique

Unique1 ?
Unique (%)2.0%

Sample

1st row은산산업단지
2nd row은산산업단지
3rd row은산산업단지
4th row은산산업단지
5th row은산산업단지

Common Values

ValueCountFrequency (%)
은산산업단지 20
40.8%
홍산산업단지 12
24.5%
은산2산업단지 10
20.4%
임천산업단지 6
 
12.2%
장암산업단지 1
 
2.0%

Length

2024-01-10T07:13:02.981351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:03.057266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
은산산업단지 20
40.8%
홍산산업단지 12
24.5%
은산2산업단지 10
20.4%
임천산업단지 6
 
12.2%
장암산업단지 1
 
2.0%
Distinct47
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size524.0 B
2024-01-10T07:13:03.233277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length9
Mean length5.9183673
Min length3

Characters and Unicode

Total characters290
Distinct characters118
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)91.8%

Sample

1st row㈜리버앤텍
2nd row㈜우리면
3rd row㈜정성면
4th row밤뜨래영농조합법인
5th row금산인삼농업협동조합
ValueCountFrequency (%)
밤뜨래영농조합법인 2
 
4.0%
㈜뉴제일이엘이씨 2
 
4.0%
준라이팅 1
 
2.0%
㈜리버앤텍 1
 
2.0%
에스원스틸㈜ 1
 
2.0%
㈜신화이엔지 1
 
2.0%
대한폴리텍㈜ 1
 
2.0%
한국유기농업개발㈜ 1
 
2.0%
㈜삼정아코텍 1
 
2.0%
㈜해송피엘 1
 
2.0%
Other values (38) 38
76.0%
2024-01-10T07:13:03.519346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
14.5%
13
 
4.5%
8
 
2.8%
7
 
2.4%
6
 
2.1%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.7%
5
 
1.7%
Other values (108) 186
64.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 247
85.2%
Other Symbol 42
 
14.5%
Space Separator 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
5.3%
8
 
3.2%
7
 
2.8%
6
 
2.4%
6
 
2.4%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (106) 180
72.9%
Other Symbol
ValueCountFrequency (%)
42
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 289
99.7%
Common 1
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
14.5%
13
 
4.5%
8
 
2.8%
7
 
2.4%
6
 
2.1%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.7%
5
 
1.7%
Other values (107) 185
64.0%
Common
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 247
85.2%
None 42
 
14.5%
ASCII 1
 
0.3%

Most frequent character per block

None
ValueCountFrequency (%)
42
100.0%
Hangul
ValueCountFrequency (%)
13
 
5.3%
8
 
3.2%
7
 
2.8%
6
 
2.4%
6
 
2.4%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (106) 180
72.9%
ASCII
ValueCountFrequency (%)
1
100.0%
Distinct36
Distinct (%)73.5%
Missing0
Missing (%)0.0%
Memory size524.0 B
2024-01-10T07:13:03.684028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length21
Mean length20.795918
Min length19

Characters and Unicode

Total characters1019
Distinct characters34
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)57.1%

Sample

1st row충청남도 부여군 은산면 은산리 26
2nd row충청남도 부여군 은산면 은산리 16-14
3rd row충청남도 부여군 은산면 은산리 12
4th row충청남도 부여군 은산면 은산리 25-6
5th row충청남도 부여군 은산면 은산리 25-14
ValueCountFrequency (%)
충청남도 49
20.0%
부여군 49
20.0%
은산면 30
12.2%
은산리 20
8.2%
홍산면 12
 
4.9%
홍양리 12
 
4.9%
가중리 10
 
4.1%
임천면 6
 
2.4%
칠산리 6
 
2.4%
13 4
 
1.6%
Other values (37) 47
19.2%
2024-01-10T07:13:03.946685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
196
19.2%
68
 
6.7%
50
 
4.9%
49
 
4.8%
49
 
4.8%
49
 
4.8%
49
 
4.8%
49
 
4.8%
49
 
4.8%
49
 
4.8%
Other values (24) 362
35.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 637
62.5%
Space Separator 196
 
19.2%
Decimal Number 161
 
15.8%
Dash Punctuation 25
 
2.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
68
10.7%
50
 
7.8%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
Other values (12) 127
19.9%
Decimal Number
ValueCountFrequency (%)
5 27
16.8%
1 26
16.1%
0 22
13.7%
3 21
13.0%
2 21
13.0%
6 17
10.6%
7 11
6.8%
9 9
 
5.6%
8 4
 
2.5%
4 3
 
1.9%
Space Separator
ValueCountFrequency (%)
196
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 637
62.5%
Common 382
37.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
68
10.7%
50
 
7.8%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
Other values (12) 127
19.9%
Common
ValueCountFrequency (%)
196
51.3%
5 27
 
7.1%
1 26
 
6.8%
- 25
 
6.5%
0 22
 
5.8%
3 21
 
5.5%
2 21
 
5.5%
6 17
 
4.5%
7 11
 
2.9%
9 9
 
2.4%
Other values (2) 7
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 637
62.5%
ASCII 382
37.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
196
51.3%
5 27
 
7.1%
1 26
 
6.8%
- 25
 
6.5%
0 22
 
5.8%
3 21
 
5.5%
2 21
 
5.5%
6 17
 
4.5%
7 11
 
2.9%
9 9
 
2.4%
Other values (2) 7
 
1.8%
Hangul
ValueCountFrequency (%)
68
10.7%
50
 
7.8%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
49
 
7.7%
Other values (12) 127
19.9%
Distinct35
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Memory size524.0 B
2024-01-10T07:13:04.104939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length25
Mean length22.469388
Min length19

Characters and Unicode

Total characters1101
Distinct characters33
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)55.1%

Sample

1st row충청남도 부여군 은산면 충의로 604
2nd row충청남도 부여군 은산면 충의로 602-32
3rd row충청남도 부여군 은산면 충의로622번길 31
4th row충청남도 부여군 은산면 충의로 602-27
5th row충청남도 부여군 은산면 충의로 602-12
ValueCountFrequency (%)
충청남도 49
19.9%
부여군 49
19.9%
은산면 30
12.2%
홍산면 12
 
4.9%
비홍로 12
 
4.9%
충의로 11
 
4.5%
충의로622번길 9
 
3.7%
은남로20번길 9
 
3.7%
임천면 6
 
2.4%
부흥로171번길 6
 
2.4%
Other values (35) 53
21.5%
2024-01-10T07:13:04.369614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
197
17.9%
70
 
6.4%
59
 
5.4%
2 59
 
5.4%
55
 
5.0%
49
 
4.5%
49
 
4.5%
49
 
4.5%
49
 
4.5%
49
 
4.5%
Other values (23) 416
37.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 689
62.6%
Decimal Number 202
 
18.3%
Space Separator 197
 
17.9%
Dash Punctuation 13
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
70
10.2%
59
 
8.6%
55
 
8.0%
49
 
7.1%
49
 
7.1%
49
 
7.1%
49
 
7.1%
49
 
7.1%
49
 
7.1%
42
 
6.1%
Other values (12) 169
24.5%
Decimal Number
ValueCountFrequency (%)
2 59
29.2%
0 29
14.4%
1 27
13.4%
6 24
11.9%
3 18
 
8.9%
7 16
 
7.9%
5 13
 
6.4%
4 10
 
5.0%
9 6
 
3.0%
Space Separator
ValueCountFrequency (%)
197
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 689
62.6%
Common 412
37.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
70
10.2%
59
 
8.6%
55
 
8.0%
49
 
7.1%
49
 
7.1%
49
 
7.1%
49
 
7.1%
49
 
7.1%
49
 
7.1%
42
 
6.1%
Other values (12) 169
24.5%
Common
ValueCountFrequency (%)
197
47.8%
2 59
 
14.3%
0 29
 
7.0%
1 27
 
6.6%
6 24
 
5.8%
3 18
 
4.4%
7 16
 
3.9%
- 13
 
3.2%
5 13
 
3.2%
4 10
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 689
62.6%
ASCII 412
37.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
197
47.8%
2 59
 
14.3%
0 29
 
7.0%
1 27
 
6.6%
6 24
 
5.8%
3 18
 
4.4%
7 16
 
3.9%
- 13
 
3.2%
5 13
 
3.2%
4 10
 
2.4%
Hangul
ValueCountFrequency (%)
70
10.2%
59
 
8.6%
55
 
8.0%
49
 
7.1%
49
 
7.1%
49
 
7.1%
49
 
7.1%
49
 
7.1%
49
 
7.1%
42
 
6.1%
Other values (12) 169
24.5%
Distinct47
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size524.0 B
Minimum1990-12-24 00:00:00
Maximum2021-07-09 00:00:00
2024-01-10T07:13:04.473844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:13:04.572230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)

종업원수
Real number (ℝ)

Distinct27
Distinct (%)55.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.673469
Minimum1
Maximum90
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2024-01-10T07:13:04.668548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q15
median10
Q321
95-th percentile45.2
Maximum90
Range89
Interquartile range (IQR)16

Descriptive statistics

Standard deviation17.416931
Coefficient of variation (CV)1.1112365
Kurtosis7.8041424
Mean15.673469
Median Absolute Deviation (MAD)6
Skewness2.5538091
Sum768
Variance303.34949
MonotonicityNot monotonic
2024-01-10T07:13:04.755845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
7 4
 
8.2%
2 4
 
8.2%
4 4
 
8.2%
6 3
 
6.1%
10 3
 
6.1%
5 3
 
6.1%
12 2
 
4.1%
11 2
 
4.1%
23 2
 
4.1%
1 2
 
4.1%
Other values (17) 20
40.8%
ValueCountFrequency (%)
1 2
4.1%
2 4
8.2%
3 1
 
2.0%
4 4
8.2%
5 3
6.1%
6 3
6.1%
7 4
8.2%
9 1
 
2.0%
10 3
6.1%
11 2
4.1%
ValueCountFrequency (%)
90 1
2.0%
72 1
2.0%
50 1
2.0%
38 1
2.0%
35 1
2.0%
29 1
2.0%
28 1
2.0%
26 2
4.1%
25 1
2.0%
23 2
4.1%

전화번호
Text

MISSING 

Distinct40
Distinct (%)95.2%
Missing7
Missing (%)14.3%
Memory size524.0 B
2024-01-10T07:13:04.918161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.047619
Min length12

Characters and Unicode

Total characters506
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)90.5%

Sample

1st row041-837-7522
2nd row041-832-1571
3rd row041-837-1571
4th row041-834-7700
5th row041-832-7884
ValueCountFrequency (%)
041-834-7700 2
 
4.8%
041-832-1571 2
 
4.8%
031-430-0369 1
 
2.4%
041-837-7522 1
 
2.4%
041-355-2851 1
 
2.4%
070-4348-2248 1
 
2.4%
041-834-8991 1
 
2.4%
041-833-5200 1
 
2.4%
031-798-1861 1
 
2.4%
031-976-3000 1
 
2.4%
Other values (30) 30
71.4%
2024-01-10T07:13:05.189518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 84
16.6%
0 74
14.6%
1 64
12.6%
4 59
11.7%
3 56
11.1%
8 51
10.1%
7 34
6.7%
2 32
 
6.3%
5 22
 
4.3%
6 19
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 422
83.4%
Dash Punctuation 84
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 74
17.5%
1 64
15.2%
4 59
14.0%
3 56
13.3%
8 51
12.1%
7 34
8.1%
2 32
7.6%
5 22
 
5.2%
6 19
 
4.5%
9 11
 
2.6%
Dash Punctuation
ValueCountFrequency (%)
- 84
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 506
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 84
16.6%
0 74
14.6%
1 64
12.6%
4 59
11.7%
3 56
11.1%
8 51
10.1%
7 34
6.7%
2 32
 
6.3%
5 22
 
4.3%
6 19
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 506
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 84
16.6%
0 74
14.6%
1 64
12.6%
4 59
11.7%
3 56
11.1%
8 51
10.1%
7 34
6.7%
2 32
 
6.3%
5 22
 
4.3%
6 19
 
3.8%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
Minimum2022-08-31 00:00:00
Maximum2022-08-31 00:00:00
2024-01-10T07:13:05.279341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:13:05.364249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-10T07:13:02.720906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:13:05.439115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
산업단지명기업명사업장 주소(지번)사업장 주소(도로명)설립일자종업원수전화번호
산업단지명1.0000.8701.0001.0001.0000.6790.901
기업명0.8701.0000.8020.7080.9870.9920.998
사업장 주소(지번)1.0000.8021.0001.0000.9880.9760.899
사업장 주소(도로명)1.0000.7081.0001.0000.9870.9650.874
설립일자1.0000.9870.9880.9871.0001.0001.000
종업원수0.6790.9920.9760.9651.0001.0000.000
전화번호0.9010.9980.8990.8741.0000.0001.000
2024-01-10T07:13:05.550757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종업원수산업단지명
종업원수1.0000.484
산업단지명0.4841.000

Missing values

2024-01-10T07:13:02.801975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:13:02.895199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

산업단지명기업명사업장 주소(지번)사업장 주소(도로명)설립일자종업원수전화번호데이터기준일자
0은산산업단지㈜리버앤텍충청남도 부여군 은산면 은산리 26충청남도 부여군 은산면 충의로 6042015-10-236041-837-75222022-08-31
1은산산업단지㈜우리면충청남도 부여군 은산면 은산리 16-14충청남도 부여군 은산면 충의로 602-322007-06-1829041-832-15712022-08-31
2은산산업단지㈜정성면충청남도 부여군 은산면 은산리 12충청남도 부여군 은산면 충의로622번길 312008-07-0113041-837-15712022-08-31
3은산산업단지밤뜨래영농조합법인충청남도 부여군 은산면 은산리 25-6충청남도 부여군 은산면 충의로 602-272005-11-0426041-834-77002022-08-31
4은산산업단지금산인삼농업협동조합충청남도 부여군 은산면 은산리 25-14충청남도 부여군 은산면 충의로 602-122003-05-1626041-832-78842022-08-31
5은산산업단지㈜비디텍충청남도 부여군 은산면 은산리 20충청남도 부여군 은산면 충의로 602-262019-01-0815041-837-44162022-08-31
6은산산업단지㈜뉴제일이엘이씨충청남도 부여군 은산면 은산리 12-10충청남도 부여군 은산면 충의로 622번길 352014-01-167041-837-01522022-08-31
7은산산업단지하나그린㈜충청남도 부여군 은산면 은산리 12-7충청남도 부여군 은산면 충의로622번길 302000-08-152041-837-01512022-08-31
8은산산업단지㈜맛푸름충청남도 부여군 은산면 은산리 15충청남도 부여군 은산면 충의로622번길 132016-05-167041-832-73052022-08-31
9은산산업단지백제라이팅충청남도 부여군 은산면 은산리 15충청남도 부여군 은산면 충의로622번길 132019-10-212041-833-01062022-08-31
산업단지명기업명사업장 주소(지번)사업장 주소(도로명)설립일자종업원수전화번호데이터기준일자
39홍산산업단지㈜비에스 부여공장충청남도 부여군 홍산면 홍양리 306-6충청남도 부여군 홍산면 비홍로 39-62006-05-1172041-836-10452022-08-31
40홍산산업단지㈜현호산업충청남도 부여군 홍산면 홍양리 306-5충청남도 부여군 홍산면 비홍로 39-202018-08-022<NA>2022-08-31
41홍산산업단지㈜에이시텍충청남도 부여군 홍산면 홍양리 306-7충청남도 부여군 홍산면 비홍로 492011-12-1412041-836-76132022-08-31
42홍산산업단지㈜케이지콘크리트충청남도 부여군 홍산면 홍양리 307-7충청남도 부여군 홍산면 비홍로 252020-03-092<NA>2022-08-31
43홍산산업단지㈜티제이콘크리트충청남도 부여군 홍산면 홍양리 307-4충청남도 부여군 홍산면 비홍로 272020-03-094041-837-83702022-08-31
44홍산산업단지동성이앤지㈜충청남도 부여군 홍산면 홍양리 307-1충청남도 부여군 홍산면 비홍로 312003-07-2911041-832-12862022-08-31
45홍산산업단지메인스틸㈜충청남도 부여군 홍산면 홍양리 307-1충청남도 부여군 홍산면 비홍로 312021-06-1510<NA>2022-08-31
46홍산산업단지준라이팅충청남도 부여군 홍산면 홍양리 307-1충청남도 부여군 홍산면 비홍로 312019-06-243041-833-92352022-08-31
47홍산산업단지㈜태산충청남도 부여군 홍산면 홍양리 306-1충청남도 부여군 홍산면 비홍로 552019-10-011202-2602-52562022-08-31
48홍산산업단지㈜에스엠산업충청남도 부여군 홍산면 홍양리 306-1충청남도 부여군 홍산면 비홍로 552020-05-046041-832-11492022-08-31