Overview

Dataset statistics

Number of variables5
Number of observations141
Missing cells141
Missing cells (%)20.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.9 KiB
Average record size in memory42.9 B

Variable types

Numeric1
Text2
Categorical1
Unsupported1

Dataset

Description전라남도 고흥군 대기오염배출물질시설사업장에 대한 데이터로 사업장명, 소재지전체주소, 주생산품명 등에 대한 정보를 제공합니다.
Author전라남도 고흥군
URLhttps://www.data.go.kr/data/15090839/fileData.do

Alerts

주생산품명 is highly imbalanced (56.2%)Imbalance
비고 has 141 (100.0%) missing valuesMissing
번호 has unique valuesUnique
비고 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 06:17:59.153811
Analysis finished2023-12-12 06:17:59.758369
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct141
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean71
Minimum1
Maximum141
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-12T15:17:59.858001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8
Q136
median71
Q3106
95-th percentile134
Maximum141
Range140
Interquartile range (IQR)70

Descriptive statistics

Standard deviation40.847277
Coefficient of variation (CV)0.57531375
Kurtosis-1.2
Mean71
Median Absolute Deviation (MAD)35
Skewness0
Sum10011
Variance1668.5
MonotonicityStrictly increasing
2023-12-12T15:18:00.072544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
98 1
 
0.7%
92 1
 
0.7%
93 1
 
0.7%
94 1
 
0.7%
95 1
 
0.7%
96 1
 
0.7%
97 1
 
0.7%
99 1
 
0.7%
90 1
 
0.7%
Other values (131) 131
92.9%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
141 1
0.7%
140 1
0.7%
139 1
0.7%
138 1
0.7%
137 1
0.7%
136 1
0.7%
135 1
0.7%
134 1
0.7%
133 1
0.7%
132 1
0.7%
Distinct134
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T15:18:00.371747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length7.3404255
Min length4

Characters and Unicode

Total characters1035
Distinct characters189
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique127 ?
Unique (%)90.1%

Sample

1st row해양영어조합법인 2공장
2nd row신동방정비
3rd row고흥군 환경순환형 가축분뇨공공처리시설
4th row(주)마르드라
5th row고흥정미소
ValueCountFrequency (%)
주)씨바이오 2
 
1.3%
제일frp조선소 2
 
1.3%
소각시설 2
 
1.3%
농어촌폐기물 2
 
1.3%
팔영농업협동조합 2
 
1.3%
영광수산 2
 
1.3%
해양영어조합법인 2
 
1.3%
팔영농협동강지소 2
 
1.3%
제일자동차정비공업사 2
 
1.3%
국립소록도병원 2
 
1.3%
Other values (136) 138
87.3%
2023-12-12T15:18:00.787562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45
 
4.3%
35
 
3.4%
( 32
 
3.1%
32
 
3.1%
) 32
 
3.1%
32
 
3.1%
29
 
2.8%
27
 
2.6%
23
 
2.2%
21
 
2.0%
Other values (179) 727
70.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 930
89.9%
Open Punctuation 32
 
3.1%
Close Punctuation 32
 
3.1%
Uppercase Letter 18
 
1.7%
Space Separator 17
 
1.6%
Decimal Number 3
 
0.3%
Lowercase Letter 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45
 
4.8%
35
 
3.8%
32
 
3.4%
32
 
3.4%
29
 
3.1%
27
 
2.9%
23
 
2.5%
21
 
2.3%
20
 
2.2%
19
 
2.0%
Other values (166) 647
69.6%
Uppercase Letter
ValueCountFrequency (%)
F 5
27.8%
P 5
27.8%
R 5
27.8%
M 2
 
11.1%
B 1
 
5.6%
Lowercase Letter
ValueCountFrequency (%)
r 1
33.3%
f 1
33.3%
p 1
33.3%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
2 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Space Separator
ValueCountFrequency (%)
17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 930
89.9%
Common 84
 
8.1%
Latin 21
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45
 
4.8%
35
 
3.8%
32
 
3.4%
32
 
3.4%
29
 
3.1%
27
 
2.9%
23
 
2.5%
21
 
2.3%
20
 
2.2%
19
 
2.0%
Other values (166) 647
69.6%
Latin
ValueCountFrequency (%)
F 5
23.8%
P 5
23.8%
R 5
23.8%
M 2
 
9.5%
r 1
 
4.8%
f 1
 
4.8%
p 1
 
4.8%
B 1
 
4.8%
Common
ValueCountFrequency (%)
( 32
38.1%
) 32
38.1%
17
20.2%
1 2
 
2.4%
2 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 930
89.9%
ASCII 105
 
10.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
45
 
4.8%
35
 
3.8%
32
 
3.4%
32
 
3.4%
29
 
3.1%
27
 
2.9%
23
 
2.5%
21
 
2.3%
20
 
2.2%
19
 
2.0%
Other values (166) 647
69.6%
ASCII
ValueCountFrequency (%)
( 32
30.5%
) 32
30.5%
17
16.2%
F 5
 
4.8%
P 5
 
4.8%
R 5
 
4.8%
M 2
 
1.9%
1 2
 
1.9%
r 1
 
1.0%
f 1
 
1.0%
Other values (3) 3
 
2.9%
Distinct136
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T15:18:01.151287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length37
Mean length24.219858
Min length16

Characters and Unicode

Total characters3415
Distinct characters108
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique131 ?
Unique (%)92.9%

Sample

1st row전라남도 고흥군 금산면 신평리 111번지 외4필지(111-4,111-9,111-10,111-13)
2nd row전라남도 고흥군 고흥읍 행정리 335-1 외 3필지
3rd row전라남도 고흥군 도덕면 신양리 2998번지
4th row전라남도 고흥군 과역면 석봉리 826
5th row전라남도 고흥군 고흥읍 등암리 480
ValueCountFrequency (%)
전라남도 141
19.1%
고흥군 141
19.1%
도양읍 31
 
4.2%
금산면 23
 
3.1%
고흥읍 19
 
2.6%
용정리 13
 
1.8%
봉암리 13
 
1.8%
동강면 12
 
1.6%
풍양면 9
 
1.2%
7
 
0.9%
Other values (218) 331
44.7%
2023-12-12T15:18:01.995941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
599
17.5%
188
 
5.5%
165
 
4.8%
161
 
4.7%
160
 
4.7%
144
 
4.2%
142
 
4.2%
142
 
4.2%
141
 
4.1%
1 138
 
4.0%
Other values (98) 1435
42.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2105
61.6%
Decimal Number 608
 
17.8%
Space Separator 599
 
17.5%
Dash Punctuation 81
 
2.4%
Other Punctuation 18
 
0.5%
Open Punctuation 2
 
0.1%
Close Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
188
 
8.9%
165
 
7.8%
161
 
7.6%
160
 
7.6%
144
 
6.8%
142
 
6.7%
142
 
6.7%
141
 
6.7%
117
 
5.6%
109
 
5.2%
Other values (83) 636
30.2%
Decimal Number
ValueCountFrequency (%)
1 138
22.7%
2 78
12.8%
3 70
11.5%
8 57
9.4%
6 49
 
8.1%
5 48
 
7.9%
7 43
 
7.1%
0 43
 
7.1%
9 42
 
6.9%
4 40
 
6.6%
Space Separator
ValueCountFrequency (%)
599
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 81
100.0%
Other Punctuation
ValueCountFrequency (%)
, 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2105
61.6%
Common 1310
38.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
188
 
8.9%
165
 
7.8%
161
 
7.6%
160
 
7.6%
144
 
6.8%
142
 
6.7%
142
 
6.7%
141
 
6.7%
117
 
5.6%
109
 
5.2%
Other values (83) 636
30.2%
Common
ValueCountFrequency (%)
599
45.7%
1 138
 
10.5%
- 81
 
6.2%
2 78
 
6.0%
3 70
 
5.3%
8 57
 
4.4%
6 49
 
3.7%
5 48
 
3.7%
7 43
 
3.3%
0 43
 
3.3%
Other values (5) 104
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2105
61.6%
ASCII 1310
38.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
599
45.7%
1 138
 
10.5%
- 81
 
6.2%
2 78
 
6.0%
3 70
 
5.3%
8 57
 
4.4%
6 49
 
3.7%
5 48
 
3.7%
7 43
 
3.3%
0 43
 
3.3%
Other values (5) 104
 
7.9%
Hangul
ValueCountFrequency (%)
188
 
8.9%
165
 
7.8%
161
 
7.6%
160
 
7.6%
144
 
6.8%
142
 
6.7%
142
 
6.7%
141
 
6.7%
117
 
5.6%
109
 
5.2%
Other values (83) 636
30.2%

주생산품명
Categorical

IMBALANCE 

Distinct19
Distinct (%)13.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
<NA>
99 
염장미역 등
17 
레미콘
 
3
유기질비료
 
3
레미콘 등
 
3
Other values (14)
16 

Length

Max length10
Median length4
Mean length4.2411348
Min length1

Unique

Unique12 ?
Unique (%)8.5%

Sample

1st row염장미역 등
2nd row<NA>
3rd row<NA>
4th row크릴전조품 등
5th row

Common Values

ValueCountFrequency (%)
<NA> 99
70.2%
염장미역 등 17
 
12.1%
레미콘 3
 
2.1%
유기질비료 3
 
2.1%
레미콘 등 3
 
2.1%
2
 
1.4%
유자차 2
 
1.4%
톱밥 1
 
0.7%
크릴전조품 등 1
 
0.7%
상토비료 1
 
0.7%
Other values (9) 9
 
6.4%

Length

2023-12-12T15:18:02.150762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 99
60.4%
22
 
13.4%
염장미역 18
 
11.0%
레미콘 6
 
3.7%
유기질비료 3
 
1.8%
유자차 3
 
1.8%
2
 
1.2%
미역 1
 
0.6%
농업용기계 1
 
0.6%
양식용부자 1
 
0.6%
Other values (8) 8
 
4.9%

비고
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing141
Missing (%)100.0%
Memory size1.4 KiB

Interactions

2023-12-12T15:17:59.434741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:18:02.222615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호주생산품명
번호1.0000.701
주생산품명0.7011.000
2023-12-12T15:18:02.297022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호주생산품명
번호1.0000.308
주생산품명0.3081.000

Missing values

2023-12-12T15:17:59.590926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:17:59.705491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호사업장명소재지전체주소주생산품명비고
01해양영어조합법인 2공장전라남도 고흥군 금산면 신평리 111번지 외4필지(111-4,111-9,111-10,111-13)염장미역 등<NA>
12신동방정비전라남도 고흥군 고흥읍 행정리 335-1 외 3필지<NA><NA>
23고흥군 환경순환형 가축분뇨공공처리시설전라남도 고흥군 도덕면 신양리 2998번지<NA><NA>
34(주)마르드라전라남도 고흥군 과역면 석봉리 826크릴전조품 등<NA>
45고흥정미소전라남도 고흥군 고흥읍 등암리 480<NA>
56제일제재소전라남도 고흥군 도양읍 관리 865-2<NA><NA>
67팔영산 편백 치유의숲 테라피센터전라남도 고흥군 영남면 금사리 산 37-1번지 (테라피 센터)<NA><NA>
78농업회사법인 탑바이오 주식회사전라남도 고흥군 대서면 화산리 747상토비료<NA>
89팔영농업협동조합전라남도 고흥군 과역면 도천리 10번지 외7필지<NA><NA>
910금성영어조합법인전라남도 고흥군 금산면 오천리 516번지염장미역 등<NA>
번호사업장명소재지전체주소주생산품명비고
131132제일수산전라남도 고흥군 도양읍 봉암리 1480번지 외 1<NA><NA>
132133(주)씨바이오전라남도 고흥군 풍양면 풍남리 1214-1번지<NA><NA>
133134거금도해조류영어조합법인전라남도 고흥군 금산면 오천리 138-1번지염장미역 등<NA>
134135천하수산전라남도 고흥군 도화면 구암리 801-1번지<NA><NA>
135136유한회사 섬나라식품전라남도 고흥군 금산면 대흥리 136-1번지<NA><NA>
136137(주)해양FRP조선소전라남도 고흥군 도양읍 용정리 2187번지<NA><NA>
137138정석수산전라남도 고흥군 두원면 용당리 815-7 외 3필지<NA><NA>
138139명인수산전라남도 고흥군 포두면 옥강리 473-9번지 ,17,18,669-2<NA><NA>
139140동은수산전라남도 고흥군 두원면 대전리 605번지<NA><NA>
140141고흥군청(도양 농어촌폐기물 소각시설)전라남도 고흥군 고흥읍 옥하리 200-2 고흥군청<NA><NA>