Overview

Dataset statistics

Number of variables7
Number of observations272
Missing cells1
Missing cells (%)0.1%
Duplicate rows1
Duplicate rows (%)0.4%
Total size in memory15.0 KiB
Average record size in memory56.5 B

Variable types

Categorical2
Text4
DateTime1

Dataset

Description이 데이터는 대기배출시설을 설치하고 신고한 사업장에 대한 현황으로, 업체명, 대표자, 주소, 업종 등을 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=347&beforeMenuCd=DOM_000000201001001000&publicdatapk=15080575

Alerts

업무구분 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 1 (0.4%) duplicate rowsDuplicates
is highly imbalanced (54.5%)Imbalance

Reproduction

Analysis started2024-01-09 22:24:33.897887
Analysis finished2024-01-09 22:24:34.687692
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업무구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
신고
272 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신고
2nd row신고
3rd row신고
4th row신고
5th row신고

Common Values

ValueCountFrequency (%)
신고 272
100.0%

Length

2024-01-10T07:24:34.739828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:24:34.807895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신고 272
100.0%
Distinct265
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-01-10T07:24:34.975134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length7.8897059
Min length2

Characters and Unicode

Total characters2146
Distinct characters282
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique258 ?
Unique (%)94.9%

Sample

1st row삼남제약(주)
2nd row(주)광성화학
3rd row경기광업(주)
4th row중앙목욕탕
5th row광흥제면
ValueCountFrequency (%)
주식회사 10
 
3.3%
금산공장 3
 
1.0%
농업회사법인 3
 
1.0%
주)태미분체 2
 
0.7%
주)광성화학 2
 
0.7%
주)이에스에프씨티 2
 
0.7%
제2공장 2
 
0.7%
대호물산(주 2
 
0.7%
주)미라이후손관거 2
 
0.7%
주)동신화학 2
 
0.7%
Other values (267) 270
90.0%
2024-01-10T07:24:35.314743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
195
 
9.1%
( 182
 
8.5%
) 182
 
8.5%
85
 
4.0%
70
 
3.3%
46
 
2.1%
37
 
1.7%
36
 
1.7%
35
 
1.6%
33
 
1.5%
Other values (272) 1245
58.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1725
80.4%
Open Punctuation 183
 
8.5%
Close Punctuation 183
 
8.5%
Space Separator 28
 
1.3%
Decimal Number 13
 
0.6%
Uppercase Letter 7
 
0.3%
Other Symbol 5
 
0.2%
Other Punctuation 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
195
 
11.3%
85
 
4.9%
70
 
4.1%
46
 
2.7%
37
 
2.1%
36
 
2.1%
35
 
2.0%
33
 
1.9%
32
 
1.9%
28
 
1.6%
Other values (252) 1128
65.4%
Uppercase Letter
ValueCountFrequency (%)
X 1
14.3%
T 1
14.3%
B 1
14.3%
M 1
14.3%
S 1
14.3%
G 1
14.3%
E 1
14.3%
Decimal Number
ValueCountFrequency (%)
2 6
46.2%
8 2
 
15.4%
9 2
 
15.4%
3 2
 
15.4%
1 1
 
7.7%
Open Punctuation
ValueCountFrequency (%)
( 182
99.5%
[ 1
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 182
99.5%
] 1
 
0.5%
Space Separator
ValueCountFrequency (%)
28
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1730
80.6%
Common 409
 
19.1%
Latin 7
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
195
 
11.3%
85
 
4.9%
70
 
4.0%
46
 
2.7%
37
 
2.1%
36
 
2.1%
35
 
2.0%
33
 
1.9%
32
 
1.8%
28
 
1.6%
Other values (253) 1133
65.5%
Common
ValueCountFrequency (%)
( 182
44.5%
) 182
44.5%
28
 
6.8%
2 6
 
1.5%
8 2
 
0.5%
9 2
 
0.5%
3 2
 
0.5%
/ 1
 
0.2%
] 1
 
0.2%
[ 1
 
0.2%
Other values (2) 2
 
0.5%
Latin
ValueCountFrequency (%)
X 1
14.3%
T 1
14.3%
B 1
14.3%
M 1
14.3%
S 1
14.3%
G 1
14.3%
E 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1725
80.4%
ASCII 416
 
19.4%
None 5
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
195
 
11.3%
85
 
4.9%
70
 
4.1%
46
 
2.7%
37
 
2.1%
36
 
2.1%
35
 
2.0%
33
 
1.9%
32
 
1.9%
28
 
1.6%
Other values (252) 1128
65.4%
ASCII
ValueCountFrequency (%)
( 182
43.8%
) 182
43.8%
28
 
6.7%
2 6
 
1.4%
8 2
 
0.5%
9 2
 
0.5%
3 2
 
0.5%
X 1
 
0.2%
/ 1
 
0.2%
] 1
 
0.2%
Other values (9) 9
 
2.2%
None
ValueCountFrequency (%)
5
100.0%
Distinct210
Distinct (%)77.2%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-01-10T07:24:35.574224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length3
Mean length3.3419118
Min length3

Characters and Unicode

Total characters909
Distinct characters152
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)72.4%

Sample

1st row대표이사
2nd row김광래
3rd row권희문
4th row김민수
5th row최광식
ValueCountFrequency (%)
대표이사 50
 
18.0%
장호윤 3
 
1.1%
김정림 2
 
0.7%
유병택 2
 
0.7%
조합장 2
 
0.7%
노민성 2
 
0.7%
유인식 2
 
0.7%
김용기 2
 
0.7%
임종득 2
 
0.7%
이상규 2
 
0.7%
Other values (205) 209
75.2%
2024-01-10T07:24:35.961372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
87
 
9.6%
53
 
5.8%
51
 
5.6%
51
 
5.6%
33
 
3.6%
27
 
3.0%
16
 
1.8%
15
 
1.7%
15
 
1.7%
15
 
1.7%
Other values (142) 546
60.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 895
98.5%
Space Separator 9
 
1.0%
Open Punctuation 2
 
0.2%
Close Punctuation 2
 
0.2%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
87
 
9.7%
53
 
5.9%
51
 
5.7%
51
 
5.7%
33
 
3.7%
27
 
3.0%
16
 
1.8%
15
 
1.7%
15
 
1.7%
15
 
1.7%
Other values (138) 532
59.4%
Space Separator
ValueCountFrequency (%)
9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 895
98.5%
Common 14
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
87
 
9.7%
53
 
5.9%
51
 
5.7%
51
 
5.7%
33
 
3.7%
27
 
3.0%
16
 
1.8%
15
 
1.7%
15
 
1.7%
15
 
1.7%
Other values (138) 532
59.4%
Common
ValueCountFrequency (%)
9
64.3%
( 2
 
14.3%
) 2
 
14.3%
1 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 895
98.5%
ASCII 14
 
1.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
87
 
9.7%
53
 
5.9%
51
 
5.7%
51
 
5.7%
33
 
3.7%
27
 
3.0%
16
 
1.8%
15
 
1.7%
15
 
1.7%
15
 
1.7%
Other values (138) 532
59.4%
ASCII
ValueCountFrequency (%)
9
64.3%
( 2
 
14.3%
) 2
 
14.3%
1 1
 
7.1%

주소
Text

Distinct249
Distinct (%)91.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-01-10T07:24:36.248621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length36
Mean length22.911765
Min length17

Characters and Unicode

Total characters6232
Distinct characters153
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique233 ?
Unique (%)85.7%

Sample

1st row충청남도 금산군 금산읍 상리 99-1
2nd row충청남도 금산군 추부면 마전리 176-2
3rd row충청남도 금산군 진산면 삼가리 336-1
4th row충청남도 금산군 금산읍 중도리 506
5th row충청남도 금산군 추부면 장대리 577
ValueCountFrequency (%)
충청남도 272
19.2%
금산군 272
19.2%
추부면 100
 
7.1%
복수면 65
 
4.6%
금성면 32
 
2.3%
용진리 26
 
1.8%
진산면 25
 
1.8%
군북면 19
 
1.3%
금산읍 18
 
1.3%
마전리 17
 
1.2%
Other values (347) 572
40.3%
2024-01-10T07:24:36.626699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1388
22.3%
331
 
5.3%
328
 
5.3%
291
 
4.7%
281
 
4.5%
278
 
4.5%
272
 
4.4%
272
 
4.4%
254
 
4.1%
249
 
4.0%
Other values (143) 2288
36.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3720
59.7%
Space Separator 1388
 
22.3%
Decimal Number 955
 
15.3%
Dash Punctuation 146
 
2.3%
Close Punctuation 9
 
0.1%
Open Punctuation 9
 
0.1%
Other Symbol 3
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
331
 
8.9%
328
 
8.8%
291
 
7.8%
281
 
7.6%
278
 
7.5%
272
 
7.3%
272
 
7.3%
254
 
6.8%
249
 
6.7%
118
 
3.2%
Other values (127) 1046
28.1%
Decimal Number
ValueCountFrequency (%)
1 198
20.7%
2 105
11.0%
5 97
10.2%
6 97
10.2%
4 86
9.0%
8 83
8.7%
3 78
 
8.2%
9 73
 
7.6%
7 69
 
7.2%
0 69
 
7.2%
Space Separator
ValueCountFrequency (%)
1388
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 146
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3723
59.7%
Common 2507
40.2%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
331
 
8.9%
328
 
8.8%
291
 
7.8%
281
 
7.5%
278
 
7.5%
272
 
7.3%
272
 
7.3%
254
 
6.8%
249
 
6.7%
118
 
3.2%
Other values (128) 1049
28.2%
Common
ValueCountFrequency (%)
1388
55.4%
1 198
 
7.9%
- 146
 
5.8%
2 105
 
4.2%
5 97
 
3.9%
6 97
 
3.9%
4 86
 
3.4%
8 83
 
3.3%
3 78
 
3.1%
9 73
 
2.9%
Other values (4) 156
 
6.2%
Latin
ValueCountFrequency (%)
B 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3720
59.7%
ASCII 2509
40.3%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1388
55.3%
1 198
 
7.9%
- 146
 
5.8%
2 105
 
4.2%
5 97
 
3.9%
6 97
 
3.9%
4 86
 
3.4%
8 83
 
3.3%
3 78
 
3.1%
9 73
 
2.9%
Other values (5) 158
 
6.3%
Hangul
ValueCountFrequency (%)
331
 
8.9%
328
 
8.8%
291
 
7.8%
281
 
7.6%
278
 
7.5%
272
 
7.3%
272
 
7.3%
254
 
6.8%
249
 
6.7%
118
 
3.2%
Other values (127) 1046
28.1%
None
ValueCountFrequency (%)
3
100.0%

업종
Text

Distinct144
Distinct (%)53.1%
Missing1
Missing (%)0.4%
Memory size2.3 KiB
2024-01-10T07:24:36.830582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length20
Mean length10.605166
Min length1

Characters and Unicode

Total characters2874
Distinct characters172
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique104 ?
Unique (%)38.4%

Sample

1st row의약품 제조업
2nd row기타 비금속광물 광업
3rd row비금속광물제조업
4th row
5th row
ValueCountFrequency (%)
제조업 126
 
17.7%
71
 
10.0%
기타 44
 
6.2%
처리업 12
 
1.7%
플라스틱 12
 
1.7%
폐기물 11
 
1.5%
자동차 10
 
1.4%
인삼식품 9
 
1.3%
목재가구 8
 
1.1%
고무 8
 
1.1%
Other values (208) 399
56.2%
2024-01-10T07:24:37.142751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
505
 
17.6%
242
 
8.4%
229
 
8.0%
183
 
6.4%
102
 
3.5%
81
 
2.8%
73
 
2.5%
59
 
2.1%
53
 
1.8%
47
 
1.6%
Other values (162) 1300
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2359
82.1%
Space Separator 505
 
17.6%
Other Punctuation 10
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
242
 
10.3%
229
 
9.7%
183
 
7.8%
102
 
4.3%
81
 
3.4%
73
 
3.1%
59
 
2.5%
53
 
2.2%
47
 
2.0%
45
 
1.9%
Other values (159) 1245
52.8%
Other Punctuation
ValueCountFrequency (%)
, 9
90.0%
· 1
 
10.0%
Space Separator
ValueCountFrequency (%)
505
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2359
82.1%
Common 515
 
17.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
242
 
10.3%
229
 
9.7%
183
 
7.8%
102
 
4.3%
81
 
3.4%
73
 
3.1%
59
 
2.5%
53
 
2.2%
47
 
2.0%
45
 
1.9%
Other values (159) 1245
52.8%
Common
ValueCountFrequency (%)
505
98.1%
, 9
 
1.7%
· 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2359
82.1%
ASCII 514
 
17.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
505
98.2%
, 9
 
1.8%
Hangul
ValueCountFrequency (%)
242
 
10.3%
229
 
9.7%
183
 
7.8%
102
 
4.3%
81
 
3.4%
73
 
3.1%
59
 
2.5%
53
 
2.2%
47
 
2.0%
45
 
1.9%
Other values (159) 1245
52.8%
None
ValueCountFrequency (%)
· 1
100.0%


Categorical

IMBALANCE 

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
5종
192 
4종
71 
3종
 
7
2종
 
1
 
1

Length

Max length2
Median length2
Mean length1.9963235
Min length1

Unique

Unique2 ?
Unique (%)0.7%

Sample

1st row4종
2nd row4종
3rd row4종
4th row5종
5th row5종

Common Values

ValueCountFrequency (%)
5종 192
70.6%
4종 71
 
26.1%
3종 7
 
2.6%
2종 1
 
0.4%
1
 
0.4%

Length

2024-01-10T07:24:37.251163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:24:37.329067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5종 192
70.8%
4종 71
 
26.2%
3종 7
 
2.6%
2종 1
 
0.4%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Minimum2023-03-22 00:00:00
Maximum2023-03-22 00:00:00
2024-01-10T07:24:37.398108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:24:37.466624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2024-01-10T07:24:34.566441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:24:34.652599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업무구분사업장명대표자주소업종데이터기준일자
0신고삼남제약(주)대표이사충청남도 금산군 금산읍 상리 99-1의약품 제조업4종2023-03-22
1신고(주)광성화학김광래충청남도 금산군 추부면 마전리 176-2기타 비금속광물 광업4종2023-03-22
2신고경기광업(주)권희문충청남도 금산군 진산면 삼가리 336-1비금속광물제조업4종2023-03-22
3신고중앙목욕탕김민수충청남도 금산군 금산읍 중도리 5065종2023-03-22
4신고광흥제면최광식충청남도 금산군 추부면 장대리 5775종2023-03-22
5신고주안아스콘(주)유인식충청남도 금산군 진산면 막현리 286-1아스콘 제조업3종2023-03-22
6신고(주)삼진당박동선충청남도 금산군 금산읍 양지리 16-14종2023-03-22
7신고(주)EG대표이사충청남도 금산군 추부면 신평리 820비금속광물제조업4종2023-03-22
8신고대륙화학공업(주) 금산공장송인혁충청남도 금산군 복수면 용진리 115-5산업용 비경화고무제품 제조업3종2023-03-22
9신고(주)금성방적윤용근충청남도 금산군 복수면 용진리 115-75종2023-03-22
업무구분사업장명대표자주소업종데이터기준일자
262신고금산환경재생산업㈜유병택충청남도 금산군 복수면 곡남리 10건축폐기물 처리업5종2023-03-22
263신고두리산업이재윤충청남도 금산군 추부면 자부리 428주방용 및 음식점용 목재가구 제조업5종2023-03-22
264신고(주)대호토건하봉순충청남도 금산군 군북면 보광리 556모래자갈채취업5종2023-03-22
265신고한국타이어TBX 차고치는집김종우 외 1충청남도 금산군 추부면 신평리 713자동차종합수리업5종2023-03-22
266신고케이로텍㈜김창현충청남도 금산군 제원면 명곡리 149플라스틱 선, 봉, 관 및 호스 제조업5종2023-03-22
267신고(유)지산토건양미애충청남도 금산군 부리면 창평리 149-12모래자갈채취업5종2023-03-22
268신고성신플라스틱기계공업대표이사충청남도 금산군 추부면 마전리 100-14플라스틱 성형5종2023-03-22
269신고에스케이렌터카 주식회사황일문충청남도 금산군 추부면 추정리 221 외 8필지자동차수리업4종2023-03-22
270신고(주)정훈엘앤이구본훈충청남도 금산군 추부면 자부리 106-1폐기물종합재활용업5종2023-03-22
271신고(주)신덕개발황승규충청남도 금산군 복수면 구례리 산3-1번지 외 15필지모래자갈채취업5종2023-03-22

Duplicate rows

Most frequently occurring

업무구분사업장명대표자주소업종데이터기준일자# duplicates
0신고(주)미라이후손관거장호윤충청남도 금산군 금성면 두곡리 490-1합성수지 및 플라스틱물질 제조업5종2023-03-222