Overview

Dataset statistics

Number of variables5
Number of observations288
Missing cells76
Missing cells (%)5.3%
Duplicate rows5
Duplicate rows (%)1.7%
Total size in memory11.4 KiB
Average record size in memory40.5 B

Variable types

Text4
Categorical1

Dataset

Description충청남도 논산시 음식물쓰레기 다량배출사업장 데이터로 상호, 사업장전화번호, 사업장도로명주소, 사업장지번주소, 사업장구분 정보를 제공하고 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=307&beforeMenuCd=DOM_000000201001001000&publicdatapk=15094387

Alerts

Dataset has 5 (1.7%) duplicate rowsDuplicates
사업장구분 is highly imbalanced (56.0%)Imbalance
사업장전화번호 has 76 (26.4%) missing valuesMissing

Reproduction

Analysis started2024-01-09 20:58:20.673041
Analysis finished2024-01-09 20:58:21.317865
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct279
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2024-01-10T05:58:21.476743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length6.6006944
Min length1

Characters and Unicode

Total characters1901
Distinct characters355
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique270 ?
Unique (%)93.8%

Sample

1st row연산자연가든
2nd row유림가든
3rd row대석골쉼터
4th row호암초등학교
5th row대둔산대성식당
ValueCountFrequency (%)
건양대학교 3
 
0.9%
행복한 3
 
0.9%
주)동원홈푸드 3
 
0.9%
벌곡(상)휴게소 2
 
0.6%
주)썬엘 2
 
0.6%
논산공장 2
 
0.6%
식당 2
 
0.6%
의료법인 2
 
0.6%
에버그린 2
 
0.6%
황산항아리보쌈 2
 
0.6%
Other values (306) 314
93.2%
2024-01-10T05:58:21.844492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
57
 
3.0%
49
 
2.6%
48
 
2.5%
46
 
2.4%
32
 
1.7%
32
 
1.7%
) 32
 
1.7%
( 31
 
1.6%
31
 
1.6%
31
 
1.6%
Other values (345) 1512
79.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1777
93.5%
Space Separator 49
 
2.6%
Close Punctuation 32
 
1.7%
Open Punctuation 31
 
1.6%
Uppercase Letter 8
 
0.4%
Lowercase Letter 2
 
0.1%
Decimal Number 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
3.2%
48
 
2.7%
46
 
2.6%
32
 
1.8%
32
 
1.8%
31
 
1.7%
31
 
1.7%
28
 
1.6%
27
 
1.5%
26
 
1.5%
Other values (331) 1419
79.9%
Uppercase Letter
ValueCountFrequency (%)
J 1
12.5%
C 1
12.5%
I 1
12.5%
V 1
12.5%
P 1
12.5%
N 1
12.5%
F 1
12.5%
S 1
12.5%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
4 1
50.0%
Space Separator
ValueCountFrequency (%)
49
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1777
93.5%
Common 114
 
6.0%
Latin 10
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
3.2%
48
 
2.7%
46
 
2.6%
32
 
1.8%
32
 
1.8%
31
 
1.7%
31
 
1.7%
28
 
1.6%
27
 
1.5%
26
 
1.5%
Other values (331) 1419
79.9%
Latin
ValueCountFrequency (%)
c 2
20.0%
J 1
10.0%
C 1
10.0%
I 1
10.0%
V 1
10.0%
P 1
10.0%
N 1
10.0%
F 1
10.0%
S 1
10.0%
Common
ValueCountFrequency (%)
49
43.0%
) 32
28.1%
( 31
27.2%
2 1
 
0.9%
4 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1777
93.5%
ASCII 124
 
6.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
57
 
3.2%
48
 
2.7%
46
 
2.6%
32
 
1.8%
32
 
1.8%
31
 
1.7%
31
 
1.7%
28
 
1.6%
27
 
1.5%
26
 
1.5%
Other values (331) 1419
79.9%
ASCII
ValueCountFrequency (%)
49
39.5%
) 32
25.8%
( 31
25.0%
c 2
 
1.6%
J 1
 
0.8%
C 1
 
0.8%
I 1
 
0.8%
V 1
 
0.8%
P 1
 
0.8%
N 1
 
0.8%
Other values (4) 4
 
3.2%

사업장전화번호
Text

MISSING 

Distinct204
Distinct (%)96.2%
Missing76
Missing (%)26.4%
Memory size2.4 KiB
2024-01-10T05:58:22.087362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters2544
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique196 ?
Unique (%)92.5%

Sample

1st row041-734-2610
2nd row041-741-7520
3rd row041-736-4008
4th row041-732-5310
5th row041-742-0040
ValueCountFrequency (%)
041-742-3344 2
 
0.9%
041-736-0020 2
 
0.9%
041-736-3200 2
 
0.9%
041-730-5287 2
 
0.9%
041-732-2923 2
 
0.9%
041-734-5250 2
 
0.9%
041-734-6888 2
 
0.9%
041-733-0404 2
 
0.9%
041-745-6789 1
 
0.5%
041-732-2332 1
 
0.5%
Other values (194) 194
91.5%
2024-01-10T05:58:22.439001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 424
16.7%
4 374
14.7%
0 365
14.3%
1 312
12.3%
7 293
11.5%
3 262
10.3%
2 141
 
5.5%
5 128
 
5.0%
6 105
 
4.1%
8 80
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2120
83.3%
Dash Punctuation 424
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 374
17.6%
0 365
17.2%
1 312
14.7%
7 293
13.8%
3 262
12.4%
2 141
 
6.7%
5 128
 
6.0%
6 105
 
5.0%
8 80
 
3.8%
9 60
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 424
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2544
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 424
16.7%
4 374
14.7%
0 365
14.3%
1 312
12.3%
7 293
11.5%
3 262
10.3%
2 141
 
5.5%
5 128
 
5.0%
6 105
 
4.1%
8 80
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2544
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 424
16.7%
4 374
14.7%
0 365
14.3%
1 312
12.3%
7 293
11.5%
3 262
10.3%
2 141
 
5.5%
5 128
 
5.0%
6 105
 
4.1%
8 80
 
3.1%
Distinct256
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2024-01-10T05:58:22.708300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length34
Mean length22.399306
Min length1

Characters and Unicode

Total characters6451
Distinct characters151
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique235 ?
Unique (%)81.6%

Sample

1st row충청남도 논산시 연산면 계백로 1992
2nd row충청남도 논산시 가야곡면 탑정로 802
3rd row충청남도 논산시 연산면 계백로 2456-3
4th row충청남도 논산시 노성면 호월로113번길 6
5th row충청남도 논산시 벌곡면 황룡재로586번길 10
ValueCountFrequency (%)
충청남도 277
19.8%
논산시 277
19.8%
취암동 50
 
3.6%
계백로 33
 
2.4%
연무읍 31
 
2.2%
연산면 31
 
2.2%
강경읍 29
 
2.1%
내동 29
 
2.1%
중앙로 18
 
1.3%
가야곡면 16
 
1.1%
Other values (352) 607
43.4%
2024-01-10T05:58:23.104399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1191
18.5%
348
 
5.4%
298
 
4.6%
291
 
4.5%
279
 
4.3%
279
 
4.3%
278
 
4.3%
277
 
4.3%
246
 
3.8%
1 218
 
3.4%
Other values (141) 2746
42.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3889
60.3%
Space Separator 1191
 
18.5%
Decimal Number 1039
 
16.1%
Close Punctuation 115
 
1.8%
Open Punctuation 115
 
1.8%
Dash Punctuation 93
 
1.4%
Connector Punctuation 9
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
348
 
8.9%
298
 
7.7%
291
 
7.5%
279
 
7.2%
279
 
7.2%
278
 
7.1%
277
 
7.1%
246
 
6.3%
136
 
3.5%
106
 
2.7%
Other values (126) 1351
34.7%
Decimal Number
ValueCountFrequency (%)
1 218
21.0%
2 135
13.0%
3 114
11.0%
4 93
9.0%
5 88
8.5%
8 85
 
8.2%
9 83
 
8.0%
0 76
 
7.3%
6 75
 
7.2%
7 72
 
6.9%
Space Separator
ValueCountFrequency (%)
1191
100.0%
Close Punctuation
ValueCountFrequency (%)
) 115
100.0%
Open Punctuation
ValueCountFrequency (%)
( 115
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 93
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3889
60.3%
Common 2562
39.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
348
 
8.9%
298
 
7.7%
291
 
7.5%
279
 
7.2%
279
 
7.2%
278
 
7.1%
277
 
7.1%
246
 
6.3%
136
 
3.5%
106
 
2.7%
Other values (126) 1351
34.7%
Common
ValueCountFrequency (%)
1191
46.5%
1 218
 
8.5%
2 135
 
5.3%
) 115
 
4.5%
( 115
 
4.5%
3 114
 
4.4%
4 93
 
3.6%
- 93
 
3.6%
5 88
 
3.4%
8 85
 
3.3%
Other values (5) 315
 
12.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3889
60.3%
ASCII 2562
39.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1191
46.5%
1 218
 
8.5%
2 135
 
5.3%
) 115
 
4.5%
( 115
 
4.5%
3 114
 
4.4%
4 93
 
3.6%
- 93
 
3.6%
5 88
 
3.4%
8 85
 
3.3%
Other values (5) 315
 
12.3%
Hangul
ValueCountFrequency (%)
348
 
8.9%
298
 
7.7%
291
 
7.5%
279
 
7.2%
279
 
7.2%
278
 
7.1%
277
 
7.1%
246
 
6.3%
136
 
3.5%
106
 
2.7%
Other values (126) 1351
34.7%
Distinct266
Distinct (%)92.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2024-01-10T05:58:23.444593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length33
Mean length21.083333
Min length1

Characters and Unicode

Total characters6072
Distinct characters151
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique245 ?
Unique (%)85.1%

Sample

1st row충청남도 논산시 연산면 한전리 109-7
2nd row충청남도 논산시 가야곡면 종연리 494-19
3rd row충청남도 논산시 연산면 송정리 302-1
4th row충청남도 논산시 노성면 호암리 312
5th row충청남도 논산시 벌곡면 한삼천리 81-2
ValueCountFrequency (%)
충청남도 286
21.5%
논산시 286
21.5%
취암동 54
 
4.1%
내동 33
 
2.5%
연무읍 32
 
2.4%
연산면 30
 
2.3%
강경읍 28
 
2.1%
가야곡면 15
 
1.1%
동산리 14
 
1.1%
황산리 12
 
0.9%
Other values (369) 541
40.6%
2024-01-10T05:58:23.898808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1333
22.0%
377
 
6.2%
292
 
4.8%
291
 
4.8%
290
 
4.8%
288
 
4.7%
287
 
4.7%
287
 
4.7%
1 219
 
3.6%
- 205
 
3.4%
Other values (141) 2203
36.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3453
56.9%
Space Separator 1333
 
22.0%
Decimal Number 1071
 
17.6%
Dash Punctuation 205
 
3.4%
Open Punctuation 4
 
0.1%
Close Punctuation 4
 
0.1%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
377
10.9%
292
 
8.5%
291
 
8.4%
290
 
8.4%
288
 
8.3%
287
 
8.3%
287
 
8.3%
167
 
4.8%
148
 
4.3%
106
 
3.1%
Other values (126) 920
26.6%
Decimal Number
ValueCountFrequency (%)
1 219
20.4%
2 133
12.4%
4 130
12.1%
3 118
11.0%
5 102
9.5%
0 86
 
8.0%
9 81
 
7.6%
7 78
 
7.3%
8 64
 
6.0%
6 60
 
5.6%
Space Separator
ValueCountFrequency (%)
1333
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 205
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3453
56.9%
Common 2619
43.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
377
10.9%
292
 
8.5%
291
 
8.4%
290
 
8.4%
288
 
8.3%
287
 
8.3%
287
 
8.3%
167
 
4.8%
148
 
4.3%
106
 
3.1%
Other values (126) 920
26.6%
Common
ValueCountFrequency (%)
1333
50.9%
1 219
 
8.4%
- 205
 
7.8%
2 133
 
5.1%
4 130
 
5.0%
3 118
 
4.5%
5 102
 
3.9%
0 86
 
3.3%
9 81
 
3.1%
7 78
 
3.0%
Other values (5) 134
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3453
56.9%
ASCII 2619
43.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1333
50.9%
1 219
 
8.4%
- 205
 
7.8%
2 133
 
5.1%
4 130
 
5.0%
3 118
 
4.5%
5 102
 
3.9%
0 86
 
3.3%
9 81
 
3.1%
7 78
 
3.0%
Other values (5) 134
 
5.1%
Hangul
ValueCountFrequency (%)
377
10.9%
292
 
8.5%
291
 
8.4%
290
 
8.4%
288
 
8.3%
287
 
8.3%
287
 
8.3%
167
 
4.8%
148
 
4.3%
106
 
3.1%
Other values (126) 920
26.6%

사업장구분
Categorical

IMBALANCE 

Distinct5
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
일반음식점
192 
집단급식소
92 
대규모점포
 
2
농수산물시장
 
1
휴게음식점
 
1

Length

Max length6
Median length5
Mean length5.0034722
Min length5

Unique

Unique2 ?
Unique (%)0.7%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row집단급식소
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 192
66.7%
집단급식소 92
31.9%
대규모점포 2
 
0.7%
농수산물시장 1
 
0.3%
휴게음식점 1
 
0.3%

Length

2024-01-10T05:58:24.025193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:58:24.119181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 192
66.7%
집단급식소 92
31.9%
대규모점포 2
 
0.7%
농수산물시장 1
 
0.3%
휴게음식점 1
 
0.3%

Missing values

2024-01-10T05:58:21.202215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:58:21.284426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호사업장전화번호사업장도로명주소사업장지번주소사업장구분
0연산자연가든041-734-2610충청남도 논산시 연산면 계백로 1992충청남도 논산시 연산면 한전리 109-7일반음식점
1유림가든041-741-7520충청남도 논산시 가야곡면 탑정로 802충청남도 논산시 가야곡면 종연리 494-19일반음식점
2대석골쉼터041-736-4008충청남도 논산시 연산면 계백로 2456-3충청남도 논산시 연산면 송정리 302-1일반음식점
3호암초등학교041-732-5310충청남도 논산시 노성면 호월로113번길 6충청남도 논산시 노성면 호암리 312집단급식소
4대둔산대성식당<NA>충청남도 논산시 벌곡면 황룡재로586번길 10충청남도 논산시 벌곡면 한삼천리 81-2일반음식점
5풍미가든041-742-0040충청남도 논산시 연무읍 득안대로 438충청남도 논산시 연무읍 죽평리 368-45일반음식점
6백마식당041-735-3044충청남도 논산시 중앙로 505-8 (대교동)충청남도 논산시 대교동 154-20일반음식점
7대만원041-736-6648충청남도 논산시 연산면 계백로 2450충청남도 논산시 연산면 송정리 301-1일반음식점
8연무대초등학교041-742-5295충청남도 논산시 연무읍 득안대로 527충청남도 논산시 연무읍 금곡리 40집단급식소
9산수숯불갈비041-735-6660충청남도 논산시 중앙로398번길 8 (취암동)충청남도 논산시 취암동 1052-3일반음식점
상호사업장전화번호사업장도로명주소사업장지번주소사업장구분
278들풀이동갈비<NA>충청남도 논산시 관촉로67번길 13-3 (지산동)충청남도 논산시 지산동 79-1일반음식점
279한국폴리텍대학바이오캠퍼스041-746-7328충청남도 논산시 강경읍 동안로 112-48충청남도 논산시 강경읍 채운리 315집단급식소
280논산여자중학교<NA>충청남도 논산시 중앙로 349-3 (취암동)충청남도 논산시 취암동 503집단급식소
281논산여자중학교<NA>충청남도 논산시 중앙로 349-3 (취암동)충청남도 논산시 취암동 503집단급식소
282의료법인 예향의료재단041-736-7584충청남도 논산시 연산면 한전2길 49-26충청남도 논산시 연산면 한전리 346-7집단급식소
283연무고등학교041-741-5674충청남도 논산시 연무읍 동안로887번길 5_ 연무고등학교_연무여자중학교충청남도 논산시 연무읍 동산리 879 연무고등학교_연무여자중학교집단급식소
284황산항아리보쌈<NA>충청남도 논산시 노성면 논산평야로 1364-1충청남도 논산시 노성면 읍내리 453-4일반음식점
285황산항아리보쌈<NA>충청남도 논산시 노성면 논산평야로 1364-1충청남도 논산시 노성면 읍내리 453-4일반음식점
286코캄구내식당041-740-3874충청남도 논산시 가야곡면 가야공단길 19충청남도 논산시 가야곡면 야촌리 500-1집단급식소
287벌곡휴게소041-732-7694충청남도 논산시 벌곡면 호남고속도로 2465-10충청남도 논산시 벌곡면 신양리 475일반음식점

Duplicate rows

Most frequently occurring

상호사업장전화번호사업장도로명주소사업장지번주소사업장구분# duplicates
0(주)썬엘 벌곡(상)휴게소041-734-5250충청남도 논산시 벌곡면 벌곡로 9-58충청남도 논산시 벌곡면 양산리 124-1대규모점포2
1논산여자중학교<NA>충청남도 논산시 중앙로 349-3 (취암동)충청남도 논산시 취암동 503집단급식소2
2놀뫼시민장례원041-733-0404충청남도 논산시 원댕이길 12 (내동)충청남도 논산시 내동 497일반음식점2
3에버그린 관광호텔041-742-3344충청남도 논산시 연무읍 황화로 369충청남도 논산시 연무읍 황화정리 976일반음식점2
4황산항아리보쌈<NA>충청남도 논산시 노성면 논산평야로 1364-1충청남도 논산시 노성면 읍내리 453-4일반음식점2