Overview

Dataset statistics

Number of variables8
Number of observations264
Missing cells0
Missing cells (%)0.0%
Duplicate rows35
Duplicate rows (%)13.3%
Total size in memory17.1 KiB
Average record size in memory66.5 B

Variable types

Text2
Numeric1
Categorical5

Dataset

Description경기도 파주시 소규모 공장 방지시설 설치 지원사업에 관한 데이터로 공장명, 설치주소, 시설용량, 설치연도, 전화번호, 관리기관 등의 내용을 포함하고 있습니다.
Author경기도 파주시
URLhttps://www.data.go.kr/data/15100908/fileData.do

Alerts

관리기관명 has constant value ""Constant
관리기관 전화번호 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 35 (13.3%) duplicate rowsDuplicates
설치연도 is highly overall correlated with 공장 전화번호High correlation
공장 전화번호 is highly overall correlated with 설치연도High correlation

Reproduction

Analysis started2023-12-12 05:09:06.668053
Analysis finished2023-12-12 05:09:07.381593
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct186
Distinct (%)70.5%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T14:09:07.567904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length12
Mean length6.1628788
Min length2

Characters and Unicode

Total characters1627
Distinct characters246
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique130 ?
Unique (%)49.2%

Sample

1st row㈜넵스
2nd row퍼시픽그린코리아
3rd row㈜한국절연물산
4th row파주부광모터스㈜
5th row파주부광모터스㈜
ValueCountFrequency (%)
주식회사 9
 
3.2%
지앤아이㈜ 6
 
2.2%
광명분체 5
 
1.8%
㈜송암씨앤씨 4
 
1.4%
㈜브이티지엠피 4
 
1.4%
솔로몬공예 4
 
1.4%
㈜서울금속 4
 
1.4%
㈜핀란디아 4
 
1.4%
㈜서부산업 4
 
1.4%
㈜르네상스환경디자인산업 3
 
1.1%
Other values (182) 232
83.2%
2023-12-12T14:09:08.031902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
142
 
8.7%
61
 
3.7%
52
 
3.2%
44
 
2.7%
35
 
2.2%
34
 
2.1%
32
 
2.0%
30
 
1.8%
30
 
1.8%
29
 
1.8%
Other values (236) 1138
69.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1410
86.7%
Other Symbol 142
 
8.7%
Space Separator 17
 
1.0%
Lowercase Letter 14
 
0.9%
Uppercase Letter 14
 
0.9%
Decimal Number 12
 
0.7%
Open Punctuation 7
 
0.4%
Close Punctuation 7
 
0.4%
Other Punctuation 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
61
 
4.3%
52
 
3.7%
44
 
3.1%
35
 
2.5%
34
 
2.4%
32
 
2.3%
30
 
2.1%
30
 
2.1%
29
 
2.1%
29
 
2.1%
Other values (212) 1034
73.3%
Uppercase Letter
ValueCountFrequency (%)
K 2
14.3%
L 2
14.3%
A 2
14.3%
T 1
7.1%
M 1
7.1%
S 1
7.1%
D 1
7.1%
J 1
7.1%
C 1
7.1%
N 1
7.1%
Lowercase Letter
ValueCountFrequency (%)
t 4
28.6%
e 2
14.3%
c 2
14.3%
o 2
14.3%
d 2
14.3%
x 2
14.3%
Decimal Number
ValueCountFrequency (%)
1 10
83.3%
2 2
 
16.7%
Other Symbol
ValueCountFrequency (%)
142
100.0%
Space Separator
ValueCountFrequency (%)
17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Other Punctuation
ValueCountFrequency (%)
. 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1552
95.4%
Common 47
 
2.9%
Latin 28
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
142
 
9.1%
61
 
3.9%
52
 
3.4%
44
 
2.8%
35
 
2.3%
34
 
2.2%
32
 
2.1%
30
 
1.9%
30
 
1.9%
29
 
1.9%
Other values (213) 1063
68.5%
Latin
ValueCountFrequency (%)
t 4
14.3%
K 2
 
7.1%
e 2
 
7.1%
c 2
 
7.1%
o 2
 
7.1%
L 2
 
7.1%
d 2
 
7.1%
x 2
 
7.1%
A 2
 
7.1%
T 1
 
3.6%
Other values (7) 7
25.0%
Common
ValueCountFrequency (%)
17
36.2%
1 10
21.3%
( 7
14.9%
) 7
14.9%
. 4
 
8.5%
2 2
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1410
86.7%
None 142
 
8.7%
ASCII 75
 
4.6%

Most frequent character per block

None
ValueCountFrequency (%)
142
100.0%
Hangul
ValueCountFrequency (%)
61
 
4.3%
52
 
3.7%
44
 
3.1%
35
 
2.5%
34
 
2.4%
32
 
2.3%
30
 
2.1%
30
 
2.1%
29
 
2.1%
29
 
2.1%
Other values (212) 1034
73.3%
ASCII
ValueCountFrequency (%)
17
22.7%
1 10
13.3%
( 7
 
9.3%
) 7
 
9.3%
t 4
 
5.3%
. 4
 
5.3%
K 2
 
2.7%
2 2
 
2.7%
e 2
 
2.7%
c 2
 
2.7%
Other values (13) 18
24.0%
Distinct198
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T14:09:08.429451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length29
Mean length22.200758
Min length14

Characters and Unicode

Total characters5861
Distinct characters162
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique145 ?
Unique (%)54.9%

Sample

1st row경기도 파주시 월롱면 황소바위길 377(,70-3)
2nd row경기도 파주시 광탄면 장지산로 368번길 45
3rd row경기도 파주시 파주읍 돈유2로 108
4th row경기도 파주시 월릉면 도감로 164
5th row경기도 파주시 월릉면 도감로 164
ValueCountFrequency (%)
파주시 257
19.8%
경기도 223
 
17.2%
광탄면 68
 
5.2%
조리읍 43
 
3.3%
월롱면 33
 
2.5%
탄현면 28
 
2.2%
파주읍 20
 
1.5%
장지산로 10
 
0.8%
120 8
 
0.6%
수레길 8
 
0.6%
Other values (334) 599
46.2%
2023-12-12T14:09:09.131546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1084
18.5%
286
 
4.9%
285
 
4.9%
264
 
4.5%
243
 
4.1%
231
 
3.9%
231
 
3.9%
1 223
 
3.8%
3 169
 
2.9%
155
 
2.6%
Other values (152) 2690
45.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3345
57.1%
Decimal Number 1176
 
20.1%
Space Separator 1084
 
18.5%
Dash Punctuation 149
 
2.5%
Close Punctuation 35
 
0.6%
Other Punctuation 35
 
0.6%
Open Punctuation 35
 
0.6%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
286
 
8.6%
285
 
8.5%
264
 
7.9%
243
 
7.3%
231
 
6.9%
231
 
6.9%
155
 
4.6%
150
 
4.5%
143
 
4.3%
96
 
2.9%
Other values (135) 1261
37.7%
Decimal Number
ValueCountFrequency (%)
1 223
19.0%
3 169
14.4%
2 125
10.6%
5 120
10.2%
4 116
9.9%
0 94
8.0%
8 93
7.9%
6 91
7.7%
7 74
 
6.3%
9 71
 
6.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
1084
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 149
100.0%
Close Punctuation
ValueCountFrequency (%)
) 35
100.0%
Other Punctuation
ValueCountFrequency (%)
, 35
100.0%
Open Punctuation
ValueCountFrequency (%)
( 35
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3345
57.1%
Common 2514
42.9%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
286
 
8.6%
285
 
8.5%
264
 
7.9%
243
 
7.3%
231
 
6.9%
231
 
6.9%
155
 
4.6%
150
 
4.5%
143
 
4.3%
96
 
2.9%
Other values (135) 1261
37.7%
Common
ValueCountFrequency (%)
1084
43.1%
1 223
 
8.9%
3 169
 
6.7%
- 149
 
5.9%
2 125
 
5.0%
5 120
 
4.8%
4 116
 
4.6%
0 94
 
3.7%
8 93
 
3.7%
6 91
 
3.6%
Other values (5) 250
 
9.9%
Latin
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3345
57.1%
ASCII 2516
42.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1084
43.1%
1 223
 
8.9%
3 169
 
6.7%
- 149
 
5.9%
2 125
 
5.0%
5 120
 
4.8%
4 116
 
4.6%
0 94
 
3.7%
8 93
 
3.7%
6 91
 
3.6%
Other values (7) 252
 
10.0%
Hangul
ValueCountFrequency (%)
286
 
8.6%
285
 
8.5%
264
 
7.9%
243
 
7.3%
231
 
6.9%
231
 
6.9%
155
 
4.6%
150
 
4.5%
143
 
4.3%
96
 
2.9%
Other values (135) 1261
37.7%
Distinct50
Distinct (%)18.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean380.04545
Minimum70
Maximum2300
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2023-12-12T14:09:09.333237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum70
5-th percentile140
Q1200
median300
Q3400
95-th percentile885
Maximum2300
Range2230
Interquartile range (IQR)200

Descriptive statistics

Standard deviation302.04004
Coefficient of variation (CV)0.79474715
Kurtosis15.124221
Mean380.04545
Median Absolute Deviation (MAD)100
Skewness3.432482
Sum100332
Variance91228.188
MonotonicityNot monotonic
2023-12-12T14:09:09.525381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
400 41
15.5%
300 31
 
11.7%
200 23
 
8.7%
380 23
 
8.7%
250 17
 
6.4%
150 14
 
5.3%
500 9
 
3.4%
350 8
 
3.0%
140 8
 
3.0%
230 8
 
3.0%
Other values (40) 82
31.1%
ValueCountFrequency (%)
70 1
 
0.4%
80 3
 
1.1%
90 1
 
0.4%
100 3
 
1.1%
110 1
 
0.4%
120 2
 
0.8%
125 1
 
0.4%
130 1
 
0.4%
140 8
3.0%
145 1
 
0.4%
ValueCountFrequency (%)
2300 1
 
0.4%
2200 1
 
0.4%
1900 1
 
0.4%
1500 4
1.5%
1300 2
0.8%
1200 1
 
0.4%
1000 1
 
0.4%
950 1
 
0.4%
900 2
0.8%
800 2
0.8%

설치연도
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2021
195 
2020
65 
2019
 
4

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 195
73.9%
2020 65
 
24.6%
2019 4
 
1.5%

Length

2023-12-12T14:09:09.698055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:09:09.828841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 195
73.9%
2020 65
 
24.6%
2019 4
 
1.5%

공장 전화번호
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)18.9%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
<NA>
96 
070-4422-6022
34 
031-424-5545
24 
02-850-3100
17 
02-6389-8096
 
9
Other values (45)
84 

Length

Max length13
Median length12
Mean length9.125
Min length4

Unique

Unique31 ?
Unique (%)11.7%

Sample

1st row02-6389-8096
2nd row02-6389-8096
3rd row02-6389-8096
4th row02-6389-8096
5th row02-6389-8096

Common Values

ValueCountFrequency (%)
<NA> 96
36.4%
070-4422-6022 34
 
12.9%
031-424-5545 24
 
9.1%
02-850-3100 17
 
6.4%
02-6389-8096 9
 
3.4%
031-492-1697 8
 
3.0%
032-327-3443 7
 
2.7%
031-351-3315 6
 
2.3%
02-850-3107 6
 
2.3%
031-998-4180 4
 
1.5%
Other values (40) 53
20.1%

Length

2023-12-12T14:09:10.040371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 96
36.4%
070-4422-6022 34
 
12.9%
031-424-5545 24
 
9.1%
02-850-3100 17
 
6.4%
02-6389-8096 9
 
3.4%
031-492-1697 8
 
3.0%
032-327-3443 7
 
2.7%
031-351-3315 6
 
2.3%
02-850-3107 6
 
2.3%
031-998-4180 4
 
1.5%
Other values (40) 53
20.1%

관리기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
경기도 파주시 환경보전과
264 

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 파주시 환경보전과
2nd row경기도 파주시 환경보전과
3rd row경기도 파주시 환경보전과
4th row경기도 파주시 환경보전과
5th row경기도 파주시 환경보전과

Common Values

ValueCountFrequency (%)
경기도 파주시 환경보전과 264
100.0%

Length

2023-12-12T14:09:10.195441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:09:10.313344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 264
33.3%
파주시 264
33.3%
환경보전과 264
33.3%

관리기관 전화번호
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
031-940-8471
264 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row031-940-8471
2nd row031-940-8471
3rd row031-940-8471
4th row031-940-8471
5th row031-940-8471

Common Values

ValueCountFrequency (%)
031-940-8471 264
100.0%

Length

2023-12-12T14:09:10.445946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:09:10.577562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
031-940-8471 264
100.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2022-06-08
264 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-06-08
2nd row2022-06-08
3rd row2022-06-08
4th row2022-06-08
5th row2022-06-08

Common Values

ValueCountFrequency (%)
2022-06-08 264
100.0%

Length

2023-12-12T14:09:10.721800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:09:10.828568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-06-08 264
100.0%

Interactions

2023-12-12T14:09:07.053204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:09:10.895531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용량(세제곱미터_분)설치연도공장 전화번호
용량(세제곱미터_분)1.0000.0000.000
설치연도0.0001.0000.914
공장 전화번호0.0000.9141.000
2023-12-12T14:09:11.038426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공장 전화번호설치연도
공장 전화번호1.0000.636
설치연도0.6361.000
2023-12-12T14:09:11.159241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용량(세제곱미터_분)설치연도공장 전화번호
용량(세제곱미터_분)1.0000.0000.000
설치연도0.0001.0000.636
공장 전화번호0.0000.6361.000

Missing values

2023-12-12T14:09:07.176540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:09:07.317380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

소규모공장명설치주소용량(세제곱미터_분)설치연도공장 전화번호관리기관명관리기관 전화번호데이터기준일자
0㈜넵스경기도 파주시 월롱면 황소바위길 377(,70-3)470202002-6389-8096경기도 파주시 환경보전과031-940-84712022-06-08
1퍼시픽그린코리아경기도 파주시 광탄면 장지산로 368번길 45400202002-6389-8096경기도 파주시 환경보전과031-940-84712022-06-08
2㈜한국절연물산경기도 파주시 파주읍 돈유2로 108200202002-6389-8096경기도 파주시 환경보전과031-940-84712022-06-08
3파주부광모터스㈜경기도 파주시 월릉면 도감로 164380202102-6389-8096경기도 파주시 환경보전과031-940-84712022-06-08
4파주부광모터스㈜경기도 파주시 월릉면 도감로 164380202102-6389-8096경기도 파주시 환경보전과031-940-84712022-06-08
5㈜엘퍼스경기도 파주시 조리읍 뇌조로 178번길 15-30300202002-6389-8096경기도 파주시 환경보전과031-940-84712022-06-08
6㈜넵스경기도 파주시 월롱면 황소바위길 377230202002-6389-8096경기도 파주시 환경보전과031-940-84712022-06-08
7㈜동신금속경기도 파주시 적성면 적성산단1로 40-17950202102-6389-8096경기도 파주시 환경보전과031-940-84712022-06-08
8㈜신일프레임파주시 월롱면 휴암로117번길 45120202102-6389-8096경기도 파주시 환경보전과031-940-84712022-06-08
9광명분체경기도 파주시 탄현면 한록산길 120150202002-850-3100경기도 파주시 환경보전과031-940-84712022-06-08
소규모공장명설치주소용량(세제곱미터_분)설치연도공장 전화번호관리기관명관리기관 전화번호데이터기준일자
254지앤아이㈜경기도 파주시 광탄면 방축리 5-7번지외 4필지(5-8,5-9,5-10,5-13)4002021<NA>경기도 파주시 환경보전과031-940-84712022-06-08
255경희공예경기도 파주시 조리읍 능안로 224-202502021<NA>경기도 파주시 환경보전과031-940-84712022-06-08
256경희공예경기도 파주시 조리읍 능안로 224-204002021<NA>경기도 파주시 환경보전과031-940-84712022-06-08
257㈜티아이캐스팅경기도 파주시 월롱면 황소바위길 187-152002021<NA>경기도 파주시 환경보전과031-940-84712022-06-08
258㈜티아이캐스팅경기도 파주시 월롱면 황소바위길 187-152002021<NA>경기도 파주시 환경보전과031-940-84712022-06-08
259로이메경기도 파주시 탄현면 검산로 361번길 25-153202021<NA>경기도 파주시 환경보전과031-940-84712022-06-08
260서원레저㈜경기도 파주시 광탄면 서원길33315002021<NA>경기도 파주시 환경보전과031-940-84712022-06-08
261서원레저㈜경기도 파주시 광탄면 서원길33315002021<NA>경기도 파주시 환경보전과031-940-84712022-06-08
262레디모터스경기도 파주시 면산말길 483802021<NA>경기도 파주시 환경보전과031-940-84712022-06-08
263태성산업사경기도 파주시 월롱면 덕은리 223-81502021<NA>경기도 파주시 환경보전과031-940-84712022-06-08

Duplicate rows

Most frequently occurring

소규모공장명설치주소용량(세제곱미터_분)설치연도공장 전화번호관리기관명관리기관 전화번호데이터기준일자# duplicates
16㈜핀란디아경기도 파주시 조리읍 매봉재길 232602021<NA>경기도 파주시 환경보전과031-940-84712022-06-084
2㈜상록수지산업사경기도 파주시 광탄면 명봉산로352번길 35200202102-850-3107경기도 파주시 환경보전과031-940-84712022-06-083
9㈜아스콘플러스파주시 월롱면 누현길 374002020031-945-3381경기도 파주시 환경보전과031-940-84712022-06-083
19광명분체경기도 파주시 탄현면 한록산길 120150202002-850-3100경기도 파주시 환경보전과031-940-84712022-06-083
01급신현대자동차공업사경기도 파주시 명봉산로 364002021031-492-1697경기도 파주시 환경보전과031-940-84712022-06-082
1㈜디자인동아그룹파주시 조리읍 문원길 263-593802021<NA>경기도 파주시 환경보전과031-940-84712022-06-082
3㈜서부산업파주시 조리읍 뇌조로 178번길 338002020031-941-7827경기도 파주시 환경보전과031-940-84712022-06-082
4㈜서부산업파주시 조리읍 뇌조로 178번길 336502021<NA>경기도 파주시 환경보전과031-940-84712022-06-082
5㈜서울금속경기도 파주시 광탄면 수레길 3604502020031-424-5545경기도 파주시 환경보전과031-940-84712022-06-082
6㈜서울금속경기도 파주시 광탄면 수레길 3607002021031-424-5545경기도 파주시 환경보전과031-940-84712022-06-082