Overview

Dataset statistics

Number of variables6
Number of observations142
Missing cells8
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.9 KiB
Average record size in memory49.9 B

Variable types

Categorical1
Text2
DateTime2
Numeric1

Dataset

Description충청남도 공주시 산업단지입주기업현황에 대한 데이터로 (산업단지명, 기업명, 설립일자, 종업원수) 등의 항목을 제공합니다,
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=419&beforeMenuCd=DOM_000000201001001000&publicdatapk=15028943

Alerts

데이터기준일 has constant value ""Constant
전화번호 has 8 (5.6%) missing valuesMissing
종업원수 has 4 (2.8%) zerosZeros

Reproduction

Analysis started2024-01-09 21:25:35.941804
Analysis finished2024-01-09 21:25:36.330185
Duration0.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

산업단지명
Categorical

Distinct13
Distinct (%)9.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
공주검상농공단지
29 
공주장기농공단지
22 
공주탄천일반산업단지
21 
공주정안2농공단지
17 
공주월미농공단지
14 
Other values (8)
39 

Length

Max length13
Median length8
Mean length8.8661972
Min length8

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row공주검상농공단지
2nd row공주검상농공단지
3rd row공주검상농공단지
4th row공주검상농공단지
5th row공주검상농공단지

Common Values

ValueCountFrequency (%)
공주검상농공단지 29
20.4%
공주장기농공단지 22
15.5%
공주탄천일반산업단지 21
14.8%
공주정안2농공단지 17
12.0%
공주월미농공단지 14
9.9%
공주정안농공단지 12
8.5%
공주유구자카드일반산업단지 7
 
4.9%
공주우성(전문)농공단지 5
 
3.5%
공주월미2농공단지 5
 
3.5%
공주보물농공단지 4
 
2.8%
Other values (3) 6
 
4.2%

Length

2024-01-10T06:25:36.379586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
공주검상농공단지 29
20.4%
공주장기농공단지 22
15.5%
공주탄천일반산업단지 21
14.8%
공주정안2농공단지 17
12.0%
공주월미농공단지 14
9.9%
공주정안농공단지 12
8.5%
공주유구자카드일반산업단지 7
 
4.9%
공주우성(전문)농공단지 5
 
3.5%
공주월미2농공단지 5
 
3.5%
공주보물농공단지 4
 
2.8%
Other values (3) 6
 
4.2%
Distinct140
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-01-10T06:25:36.553880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length8.6338028
Min length3

Characters and Unicode

Total characters1226
Distinct characters209
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique138 ?
Unique (%)97.2%

Sample

1st row(주) 피엔제이 생활건강
2nd row(주)고려헬스팜
3rd row(주)수안산업
4th row(주)신일팜글라스
5th row(주)에스피
ValueCountFrequency (%)
주식회사 12
 
6.7%
공주 4
 
2.2%
동원시스템즈(주 4
 
2.2%
제2공장 3
 
1.7%
주)한일 3
 
1.7%
솔브레인(주 3
 
1.7%
2
 
1.1%
엠씨솔루션(주 2
 
1.1%
공주공장 2
 
1.1%
주)뉴올 2
 
1.1%
Other values (138) 141
79.2%
2024-01-10T06:25:36.855519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
134
 
10.9%
( 106
 
8.6%
) 106
 
8.6%
40
 
3.3%
37
 
3.0%
37
 
3.0%
28
 
2.3%
25
 
2.0%
21
 
1.7%
20
 
1.6%
Other values (199) 672
54.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 946
77.2%
Open Punctuation 106
 
8.6%
Close Punctuation 106
 
8.6%
Space Separator 37
 
3.0%
Decimal Number 16
 
1.3%
Uppercase Letter 12
 
1.0%
Other Punctuation 2
 
0.2%
Lowercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
134
 
14.2%
40
 
4.2%
37
 
3.9%
28
 
3.0%
25
 
2.6%
21
 
2.2%
20
 
2.1%
19
 
2.0%
18
 
1.9%
18
 
1.9%
Other values (179) 586
61.9%
Uppercase Letter
ValueCountFrequency (%)
O 2
16.7%
K 2
16.7%
S 2
16.7%
G 2
16.7%
F 1
8.3%
B 1
8.3%
J 1
8.3%
L 1
8.3%
Decimal Number
ValueCountFrequency (%)
2 7
43.8%
1 3
18.8%
3 3
18.8%
7 1
 
6.2%
4 1
 
6.2%
6 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
· 1
50.0%
& 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 106
100.0%
Close Punctuation
ValueCountFrequency (%)
) 106
100.0%
Space Separator
ValueCountFrequency (%)
37
100.0%
Lowercase Letter
ValueCountFrequency (%)
n 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 946
77.2%
Common 267
 
21.8%
Latin 13
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
134
 
14.2%
40
 
4.2%
37
 
3.9%
28
 
3.0%
25
 
2.6%
21
 
2.2%
20
 
2.1%
19
 
2.0%
18
 
1.9%
18
 
1.9%
Other values (179) 586
61.9%
Common
ValueCountFrequency (%)
( 106
39.7%
) 106
39.7%
37
 
13.9%
2 7
 
2.6%
1 3
 
1.1%
3 3
 
1.1%
7 1
 
0.4%
· 1
 
0.4%
& 1
 
0.4%
4 1
 
0.4%
Latin
ValueCountFrequency (%)
O 2
15.4%
K 2
15.4%
S 2
15.4%
G 2
15.4%
F 1
7.7%
B 1
7.7%
n 1
7.7%
J 1
7.7%
L 1
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 946
77.2%
ASCII 279
 
22.8%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
134
 
14.2%
40
 
4.2%
37
 
3.9%
28
 
3.0%
25
 
2.6%
21
 
2.2%
20
 
2.1%
19
 
2.0%
18
 
1.9%
18
 
1.9%
Other values (179) 586
61.9%
ASCII
ValueCountFrequency (%)
( 106
38.0%
) 106
38.0%
37
 
13.3%
2 7
 
2.5%
1 3
 
1.1%
3 3
 
1.1%
O 2
 
0.7%
K 2
 
0.7%
S 2
 
0.7%
G 2
 
0.7%
Other values (9) 9
 
3.2%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct139
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
Minimum1987-06-21 00:00:00
Maximum2017-12-26 00:00:00
2024-01-10T06:25:36.967685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:25:37.069608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

종업원수
Real number (ℝ)

ZEROS 

Distinct60
Distinct (%)42.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47.457746
Minimum0
Maximum1550
Zeros4
Zeros (%)2.8%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-01-10T06:25:37.166900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q18
median15
Q332
95-th percentile119.65
Maximum1550
Range1550
Interquartile range (IQR)24

Descriptive statistics

Standard deviation155.22007
Coefficient of variation (CV)3.2707005
Kurtosis69.774479
Mean47.457746
Median Absolute Deviation (MAD)10
Skewness7.933277
Sum6739
Variance24093.271
MonotonicityNot monotonic
2024-01-10T06:25:37.285347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 10
 
7.0%
12 8
 
5.6%
20 6
 
4.2%
4 6
 
4.2%
5 6
 
4.2%
8 6
 
4.2%
15 6
 
4.2%
3 5
 
3.5%
18 5
 
3.5%
6 4
 
2.8%
Other values (50) 80
56.3%
ValueCountFrequency (%)
0 4
 
2.8%
2 2
 
1.4%
3 5
3.5%
4 6
4.2%
5 6
4.2%
6 4
 
2.8%
7 4
 
2.8%
8 6
4.2%
9 3
 
2.1%
10 10
7.0%
ValueCountFrequency (%)
1550 1
0.7%
928 1
0.7%
372 1
0.7%
285 1
0.7%
200 1
0.7%
164 1
0.7%
129 1
0.7%
120 1
0.7%
113 1
0.7%
110 1
0.7%

전화번호
Text

MISSING 

Distinct121
Distinct (%)90.3%
Missing8
Missing (%)5.6%
Memory size1.2 KiB
2024-01-10T06:25:37.495890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.992537
Min length11

Characters and Unicode

Total characters1607
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique113 ?
Unique (%)84.3%

Sample

1st row041-853-1995
2nd row041-852-1041
3rd row041-855-1727
4th row041-852-9037
5th row041-854-8593
ValueCountFrequency (%)
041-852-1636 7
 
5.2%
041-854-2303 2
 
1.5%
041-853-8070 2
 
1.5%
041-881-9604 2
 
1.5%
041-852-3319 2
 
1.5%
041-858-8742 2
 
1.5%
041-855-1780 2
 
1.5%
041-840-0550 2
 
1.5%
041-556-5613 1
 
0.7%
02-2113-7718 1
 
0.7%
Other values (111) 111
82.8%
2024-01-10T06:25:37.813762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 268
16.7%
0 248
15.4%
1 211
13.1%
4 181
11.3%
8 166
10.3%
5 137
8.5%
3 112
7.0%
2 97
 
6.0%
7 70
 
4.4%
6 64
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1339
83.3%
Dash Punctuation 268
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 248
18.5%
1 211
15.8%
4 181
13.5%
8 166
12.4%
5 137
10.2%
3 112
8.4%
2 97
 
7.2%
7 70
 
5.2%
6 64
 
4.8%
9 53
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 268
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1607
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 268
16.7%
0 248
15.4%
1 211
13.1%
4 181
11.3%
8 166
10.3%
5 137
8.5%
3 112
7.0%
2 97
 
6.0%
7 70
 
4.4%
6 64
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1607
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 268
16.7%
0 248
15.4%
1 211
13.1%
4 181
11.3%
8 166
10.3%
5 137
8.5%
3 112
7.0%
2 97
 
6.0%
7 70
 
4.4%
6 64
 
4.0%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
Minimum2018-02-28 00:00:00
Maximum2018-02-28 00:00:00
2024-01-10T06:25:37.919232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:25:38.015222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-10T06:25:36.122748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:25:38.084311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
산업단지명종업원수
산업단지명1.0000.000
종업원수0.0001.000
2024-01-10T06:25:38.164064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종업원수산업단지명
종업원수1.0000.000
산업단지명0.0001.000

Missing values

2024-01-10T06:25:36.219203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:25:36.298567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

산업단지명기업명설립일자종업원수전화번호데이터기준일
0공주검상농공단지(주) 피엔제이 생활건강2013-01-2442041-853-19952018-02-28
1공주검상농공단지(주)고려헬스팜2006-08-0318041-852-10412018-02-28
2공주검상농공단지(주)수안산업2000-02-2115041-855-17272018-02-28
3공주검상농공단지(주)신일팜글라스1995-07-2057041-852-90372018-02-28
4공주검상농공단지(주)에스피2004-05-118041-854-85932018-02-28
5공주검상농공단지(주)에치케이피2004-01-088041-853-01332018-02-28
6공주검상농공단지대성산업가스(주)2011-06-29802-721-08272018-02-28
7공주검상농공단지대주이엔티(주)1998-12-0432041-852-92302018-02-28
8공주검상농공단지대주중공업(주)2001-03-2852041-854-01012018-02-28
9공주검상농공단지솔브레인(주) 제1공장1997-01-10129041-852-16362018-02-28
산업단지명기업명설립일자종업원수전화번호데이터기준일
132공주탄천일반산업단지네이처런스(주)2014-06-1114041-852-97782018-02-28
133공주탄천일반산업단지농업회사법인 주식회사 국일에프앤비2014-12-045041-853-39772018-02-28
134공주탄천일반산업단지다이앤텍2013-02-0510031-449-53712018-02-28
135공주탄천일반산업단지미원화학(주) 탄천공장2017-03-2010041-858-80032018-02-28
136공주탄천일반산업단지삼화페인트공업(주)2014-04-02120041-855-85422018-02-28
137공주탄천일반산업단지엔피케미칼2015-10-2912031-431-03252018-02-28
138공주탄천일반산업단지오메가테크놀로지(주)2015-10-2624041-858-59202018-02-28
139공주탄천일반산업단지오씨아이스페셜티주식회사2011-12-22200041-580-01122018-02-28
140공주탄천일반산업단지정의산업(주)2014-07-1810031-352-85372018-02-28
141공주탄천일반산업단지주식회사 메디켐2013-07-0817031-494-28842018-02-28