Overview

Dataset statistics

Number of variables5
Number of observations193
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.7 KiB
Average record size in memory40.7 B

Variable types

Text4
Categorical1

Dataset

Description전라남도 순천시에 있는 대기배출시설에 대한 사업장명, 소재지, 도로명소재지, 대표업종, 종 등을 제공하는 데이터 입니다.
URLhttps://www.data.go.kr/data/15117385/fileData.do

Reproduction

Analysis started2023-12-12 07:40:30.059613
Analysis finished2023-12-12 07:40:30.619610
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct191
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T16:40:30.835771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length19
Mean length8.3937824
Min length3

Characters and Unicode

Total characters1620
Distinct characters244
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique189 ?
Unique (%)97.9%

Sample

1st row(주)한국카라겐
2nd row매일식품(주)
3rd row호성산업(주)
4th row광명목욕탕
5th row아성탕
ValueCountFrequency (%)
주식회사 9
 
3.9%
3
 
1.3%
뉴코아순천점 2
 
0.9%
전진환경(주 2
 
0.9%
순천지점 2
 
0.9%
콘크리트 2
 
0.9%
순천시 2
 
0.9%
백진환경(자 2
 
0.9%
순천브레이크 1
 
0.4%
순천대학교 1
 
0.4%
Other values (207) 207
88.8%
2023-12-12T16:40:31.320411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
101
 
6.2%
( 94
 
5.8%
) 94
 
5.8%
64
 
4.0%
54
 
3.3%
50
 
3.1%
48
 
3.0%
43
 
2.7%
41
 
2.5%
40
 
2.5%
Other values (234) 991
61.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1379
85.1%
Open Punctuation 94
 
5.8%
Close Punctuation 94
 
5.8%
Space Separator 40
 
2.5%
Decimal Number 6
 
0.4%
Uppercase Letter 6
 
0.4%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
101
 
7.3%
64
 
4.6%
54
 
3.9%
50
 
3.6%
48
 
3.5%
43
 
3.1%
41
 
3.0%
38
 
2.8%
37
 
2.7%
26
 
1.9%
Other values (222) 877
63.6%
Uppercase Letter
ValueCountFrequency (%)
C 1
16.7%
N 1
16.7%
E 1
16.7%
R 1
16.7%
S 1
16.7%
D 1
16.7%
Decimal Number
ValueCountFrequency (%)
1 4
66.7%
2 2
33.3%
Open Punctuation
ValueCountFrequency (%)
( 94
100.0%
Close Punctuation
ValueCountFrequency (%)
) 94
100.0%
Space Separator
ValueCountFrequency (%)
40
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1380
85.2%
Common 234
 
14.4%
Latin 6
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
101
 
7.3%
64
 
4.6%
54
 
3.9%
50
 
3.6%
48
 
3.5%
43
 
3.1%
41
 
3.0%
38
 
2.8%
37
 
2.7%
26
 
1.9%
Other values (223) 878
63.6%
Latin
ValueCountFrequency (%)
C 1
16.7%
N 1
16.7%
E 1
16.7%
R 1
16.7%
S 1
16.7%
D 1
16.7%
Common
ValueCountFrequency (%)
( 94
40.2%
) 94
40.2%
40
17.1%
1 4
 
1.7%
2 2
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1379
85.1%
ASCII 240
 
14.8%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
101
 
7.3%
64
 
4.6%
54
 
3.9%
50
 
3.6%
48
 
3.5%
43
 
3.1%
41
 
3.0%
38
 
2.8%
37
 
2.7%
26
 
1.9%
Other values (222) 877
63.6%
ASCII
ValueCountFrequency (%)
( 94
39.2%
) 94
39.2%
40
16.7%
1 4
 
1.7%
2 2
 
0.8%
C 1
 
0.4%
N 1
 
0.4%
E 1
 
0.4%
R 1
 
0.4%
S 1
 
0.4%
None
ValueCountFrequency (%)
1
100.0%
Distinct181
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T16:40:31.688936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length72
Median length32
Mean length21.440415
Min length1

Characters and Unicode

Total characters4138
Distinct characters167
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique179 ?
Unique (%)92.7%

Sample

1st row전라남도 순천시 서면 선평리 253
2nd row전라남도 순천시 서면 선평리 292
3rd row전라남도 순천시 서면 압곡리 769-10
4th row전라남도 순천시 매곡동 124-12
5th row전라남도 순천시 조곡동 152-34
ValueCountFrequency (%)
전라남도 181
20.0%
순천시 181
20.0%
서면 48
 
5.3%
해룡면 27
 
3.0%
별량면 23
 
2.5%
압곡리 18
 
2.0%
조례동 16
 
1.8%
주암면 13
 
1.4%
구상리 12
 
1.3%
금치리 11
 
1.2%
Other values (293) 375
41.4%
2023-12-12T16:40:32.213091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
923
22.3%
192
 
4.6%
192
 
4.6%
190
 
4.6%
189
 
4.6%
182
 
4.4%
181
 
4.4%
181
 
4.4%
1 139
 
3.4%
- 124
 
3.0%
Other values (157) 1645
39.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2359
57.0%
Space Separator 923
 
22.3%
Decimal Number 714
 
17.3%
Dash Punctuation 124
 
3.0%
Open Punctuation 8
 
0.2%
Close Punctuation 8
 
0.2%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
192
 
8.1%
192
 
8.1%
190
 
8.1%
189
 
8.0%
182
 
7.7%
181
 
7.7%
181
 
7.7%
119
 
5.0%
117
 
5.0%
73
 
3.1%
Other values (141) 743
31.5%
Decimal Number
ValueCountFrequency (%)
1 139
19.5%
2 102
14.3%
4 72
10.1%
3 71
9.9%
8 68
9.5%
5 65
9.1%
7 59
8.3%
9 55
 
7.7%
6 43
 
6.0%
0 40
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
N 1
50.0%
C 1
50.0%
Space Separator
ValueCountFrequency (%)
923
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 124
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2359
57.0%
Common 1777
42.9%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
192
 
8.1%
192
 
8.1%
190
 
8.1%
189
 
8.0%
182
 
7.7%
181
 
7.7%
181
 
7.7%
119
 
5.0%
117
 
5.0%
73
 
3.1%
Other values (141) 743
31.5%
Common
ValueCountFrequency (%)
923
51.9%
1 139
 
7.8%
- 124
 
7.0%
2 102
 
5.7%
4 72
 
4.1%
3 71
 
4.0%
8 68
 
3.8%
5 65
 
3.7%
7 59
 
3.3%
9 55
 
3.1%
Other values (4) 99
 
5.6%
Latin
ValueCountFrequency (%)
N 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2359
57.0%
ASCII 1779
43.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
923
51.9%
1 139
 
7.8%
- 124
 
7.0%
2 102
 
5.7%
4 72
 
4.0%
3 71
 
4.0%
8 68
 
3.8%
5 65
 
3.7%
7 59
 
3.3%
9 55
 
3.1%
Other values (6) 101
 
5.7%
Hangul
ValueCountFrequency (%)
192
 
8.1%
192
 
8.1%
190
 
8.1%
189
 
8.0%
182
 
7.7%
181
 
7.7%
181
 
7.7%
119
 
5.0%
117
 
5.0%
73
 
3.1%
Other values (141) 743
31.5%
Distinct173
Distinct (%)89.6%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T16:40:32.570206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length35
Mean length21.186528
Min length1

Characters and Unicode

Total characters4089
Distinct characters204
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique167 ?
Unique (%)86.5%

Sample

1st row전라남도 순천시 서면 산단4길 90
2nd row전라남도 순천시 서면 산단1길 16
3rd row전라남도 순천시 서면 산단4길 56
4th row전라남도 순천시 중앙로 138-1 (매곡동)
5th row전라남도 순천시 역전길 33 (조곡동)
ValueCountFrequency (%)
전라남도 177
19.0%
순천시 177
19.0%
서면 46
 
4.9%
해룡면 24
 
2.6%
별량면 21
 
2.2%
조례동 17
 
1.8%
녹색로 14
 
1.5%
주암면 11
 
1.2%
주석로 9
 
1.0%
산단4길 8
 
0.9%
Other values (289) 430
46.0%
2023-12-12T16:40:33.185266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
809
19.8%
209
 
5.1%
203
 
5.0%
186
 
4.5%
179
 
4.4%
179
 
4.4%
178
 
4.4%
177
 
4.3%
1 115
 
2.8%
106
 
2.6%
Other values (194) 1748
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2555
62.5%
Space Separator 809
 
19.8%
Decimal Number 530
 
13.0%
Open Punctuation 80
 
2.0%
Close Punctuation 80
 
2.0%
Dash Punctuation 32
 
0.8%
Uppercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
209
 
8.2%
203
 
7.9%
186
 
7.3%
179
 
7.0%
179
 
7.0%
178
 
7.0%
177
 
6.9%
106
 
4.1%
103
 
4.0%
82
 
3.2%
Other values (177) 953
37.3%
Decimal Number
ValueCountFrequency (%)
1 115
21.7%
2 80
15.1%
3 72
13.6%
4 51
9.6%
0 42
 
7.9%
5 41
 
7.7%
7 39
 
7.4%
9 34
 
6.4%
6 32
 
6.0%
8 24
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
B 1
33.3%
T 1
33.3%
L 1
33.3%
Space Separator
ValueCountFrequency (%)
809
100.0%
Open Punctuation
ValueCountFrequency (%)
( 80
100.0%
Close Punctuation
ValueCountFrequency (%)
) 80
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2555
62.5%
Common 1531
37.4%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
209
 
8.2%
203
 
7.9%
186
 
7.3%
179
 
7.0%
179
 
7.0%
178
 
7.0%
177
 
6.9%
106
 
4.1%
103
 
4.0%
82
 
3.2%
Other values (177) 953
37.3%
Common
ValueCountFrequency (%)
809
52.8%
1 115
 
7.5%
( 80
 
5.2%
) 80
 
5.2%
2 80
 
5.2%
3 72
 
4.7%
4 51
 
3.3%
0 42
 
2.7%
5 41
 
2.7%
7 39
 
2.5%
Other values (4) 122
 
8.0%
Latin
ValueCountFrequency (%)
B 1
33.3%
T 1
33.3%
L 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2555
62.5%
ASCII 1534
37.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
809
52.7%
1 115
 
7.5%
( 80
 
5.2%
) 80
 
5.2%
2 80
 
5.2%
3 72
 
4.7%
4 51
 
3.3%
0 42
 
2.7%
5 41
 
2.7%
7 39
 
2.5%
Other values (7) 125
 
8.1%
Hangul
ValueCountFrequency (%)
209
 
8.2%
203
 
7.9%
186
 
7.3%
179
 
7.0%
179
 
7.0%
178
 
7.0%
177
 
6.9%
106
 
4.1%
103
 
4.0%
82
 
3.2%
Other values (177) 953
37.3%
Distinct75
Distinct (%)38.9%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T16:40:33.529511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length20
Mean length8.0673575
Min length1

Characters and Unicode

Total characters1557
Distinct characters133
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)25.4%

Sample

1st row
2nd row장류 제조업
3rd row레미콘 제조업
4th row
5th row
ValueCountFrequency (%)
자동차 44
 
10.2%
제조업 42
 
9.8%
수리업 40
 
9.3%
종합 37
 
8.6%
22
 
5.1%
기타 20
 
4.7%
처리업 17
 
4.0%
폐기물 16
 
3.7%
전문 5
 
1.2%
비금속광물제품 4
 
0.9%
Other values (115) 183
42.6%
2023-12-12T16:40:33.959220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
343
22.0%
139
 
8.9%
65
 
4.2%
64
 
4.1%
53
 
3.4%
48
 
3.1%
47
 
3.0%
46
 
3.0%
44
 
2.8%
40
 
2.6%
Other values (123) 668
42.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1214
78.0%
Space Separator 343
 
22.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
139
 
11.4%
65
 
5.4%
64
 
5.3%
53
 
4.4%
48
 
4.0%
47
 
3.9%
46
 
3.8%
44
 
3.6%
40
 
3.3%
39
 
3.2%
Other values (122) 629
51.8%
Space Separator
ValueCountFrequency (%)
343
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1214
78.0%
Common 343
 
22.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
139
 
11.4%
65
 
5.4%
64
 
5.3%
53
 
4.4%
48
 
4.0%
47
 
3.9%
46
 
3.8%
44
 
3.6%
40
 
3.3%
39
 
3.2%
Other values (122) 629
51.8%
Common
ValueCountFrequency (%)
343
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1214
78.0%
ASCII 343
 
22.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
343
100.0%
Hangul
ValueCountFrequency (%)
139
 
11.4%
65
 
5.4%
64
 
5.3%
53
 
4.4%
48
 
4.0%
47
 
3.9%
46
 
3.8%
44
 
3.6%
40
 
3.3%
39
 
3.2%
Other values (122) 629
51.8%


Categorical

Distinct3
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
5종
123 
4종
66 
3종
 
4

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5종
2nd row5종
3rd row4종
4th row5종
5th row5종

Common Values

ValueCountFrequency (%)
5종 123
63.7%
4종 66
34.2%
3종 4
 
2.1%

Length

2023-12-12T16:40:34.067281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:40:34.143703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5종 123
63.7%
4종 66
34.2%
3종 4
 
2.1%

Correlations

2023-12-12T16:40:34.196396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대표업종
대표업종1.0000.716
0.7161.000

Missing values

2023-12-12T16:40:30.484147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:40:30.581043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명소재지도로명소재지대표업종
0(주)한국카라겐전라남도 순천시 서면 선평리 253전라남도 순천시 서면 산단4길 905종
1매일식품(주)전라남도 순천시 서면 선평리 292전라남도 순천시 서면 산단1길 16장류 제조업5종
2호성산업(주)전라남도 순천시 서면 압곡리 769-10전라남도 순천시 서면 산단4길 56레미콘 제조업4종
3광명목욕탕전라남도 순천시 매곡동 124-12전라남도 순천시 중앙로 138-1 (매곡동)5종
4아성탕전라남도 순천시 조곡동 152-34전라남도 순천시 역전길 33 (조곡동)5종
5옥천탕전라남도 순천시 저전동 143-5전라남도 순천시 서문로 16 (저전동)5종
6순천탕전라남도 순천시 동외동 225-6전라남도 순천시 북문길 90 (동외동)5종
7금호탕전라남도 순천시 매곡동 472-7전라남도 순천시 북문길 202 (매곡동)5종
8대흥정미소전라남도 순천시 서면 선평리 1035 대흥정미소전라남도 순천시 서면 선평길 40 대흥정미소4종
9한국신광마이크로애렉트로닉스(주)전라남도 순천시 서면 선평리 35전라남도 순천시 서면 산단1길 32기타 전자부품 제조업4종
사업장명소재지도로명소재지대표업종
183(주)삼일산업건영전라남도 순천시 황전면 죽내리 산 3-5비금속광물 분쇄물 생산업4종
184금령주식회사전라남도 순천시 해룡면 호두리 668전라남도 순천시 해룡면 해룡산단6로 95-15건설용 쇄석 생산업4종
185전진환경(주) 1공장전라남도 순천시 연향동 644-5 전진환경사무실 작업장전라남도 순천시 명말1길 79 전진환경사무실 작업장 (연향동)지정외 폐기물 처리업5종
186전진환경(주) 2공장전라남도 순천시 인월동 54-57전라남도 순천시 녹색로 1230 1238동 (인월동)지정외 폐기물 처리업5종
187체육시설운영과(신대 유청소년수영장)전라남도 순천시 해룡면 신대리 1980전라남도 순천시 해룡면 매안로 138기타 스포츠시설 운영업5종
188(주)정인산업전라남도 순천시 서면 압곡리 1036-2전라남도 순천시 서면 산단2길 85가공 및 재생 플라스틱원료 생산업4종
189(주)모다이노칩전라남도 순천시 해룡면 남가리 99 순천만플라자 모다아울렛전라남도 순천시 해룡면 상성길 70 순천만플라자 모다아울렛상품 종합 도매업5종
190태양환경(주)전라남도 순천시 서면 압곡리 40전라남도 순천시 서면 구랑실재길 35-10지정외 폐기물 처리업4종
191(주)대방레미콘전라남도 순천시 별량면 봉림리 323 326-2 324-6레미콘 제조업5종
192(주)줌톤전라남도 순천시 서면 압곡리 1007전라남도 순천시 서면 산단4길 12기타 석제품 제조업5종