Overview

Dataset statistics

Number of variables6
Number of observations332
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.0 KiB
Average record size in memory49.4 B

Variable types

Text3
Categorical1
Numeric1
DateTime1

Dataset

Description춘천시에서 영업중인 업장 중 음식물쓰레기가 대량 발생하는 업장의 상호 목록, 지번 및 도로명 주소, 일반음식점 및 집단급식소 등의 구분 및 면적에 대한 정보
Author강원도 춘천시
URLhttps://www.data.go.kr/data/15093751/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
사업장구분 is highly imbalanced (55.8%)Imbalance

Reproduction

Analysis started2023-12-12 04:31:09.699358
Analysis finished2023-12-12 04:31:10.578732
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct331
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T13:31:10.784000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length7.3825301
Min length1

Characters and Unicode

Total characters2451
Distinct characters396
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique330 ?
Unique (%)99.4%

Sample

1st row하주골
2nd row디아펠리즈
3rd row돈가스클럽
4th row푸주옥
5th row산토리니
ValueCountFrequency (%)
후평점 3
 
0.8%
그린나래(주 2
 
0.5%
주)지엘파가니카클럽하우스 2
 
0.5%
주)아워홈 2
 
0.5%
원조쌈밥집 2
 
0.5%
통나무집닭갈비 2
 
0.5%
클럽하우스대식당 1
 
0.3%
소양강농원 1
 
0.3%
뚜레 1
 
0.3%
춘천명종닭갈비 1
 
0.3%
Other values (352) 352
95.4%
2023-12-12T13:31:11.249612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
82
 
3.3%
77
 
3.1%
67
 
2.7%
61
 
2.5%
58
 
2.4%
48
 
2.0%
47
 
1.9%
45
 
1.8%
( 40
 
1.6%
40
 
1.6%
Other values (386) 1886
76.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2287
93.3%
Open Punctuation 40
 
1.6%
Close Punctuation 40
 
1.6%
Space Separator 37
 
1.5%
Decimal Number 29
 
1.2%
Uppercase Letter 15
 
0.6%
Connector Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
82
 
3.6%
77
 
3.4%
67
 
2.9%
61
 
2.7%
58
 
2.5%
48
 
2.1%
47
 
2.1%
45
 
2.0%
40
 
1.7%
39
 
1.7%
Other values (365) 1723
75.3%
Decimal Number
ValueCountFrequency (%)
1 6
20.7%
2 5
17.2%
8 5
17.2%
9 3
10.3%
0 3
10.3%
4 2
 
6.9%
3 2
 
6.9%
6 2
 
6.9%
7 1
 
3.4%
Uppercase Letter
ValueCountFrequency (%)
T 4
26.7%
C 4
26.7%
D 2
13.3%
I 1
 
6.7%
G 1
 
6.7%
S 1
 
6.7%
L 1
 
6.7%
B 1
 
6.7%
Open Punctuation
ValueCountFrequency (%)
( 40
100.0%
Close Punctuation
ValueCountFrequency (%)
) 40
100.0%
Space Separator
ValueCountFrequency (%)
37
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2287
93.3%
Common 149
 
6.1%
Latin 15
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
82
 
3.6%
77
 
3.4%
67
 
2.9%
61
 
2.7%
58
 
2.5%
48
 
2.1%
47
 
2.1%
45
 
2.0%
40
 
1.7%
39
 
1.7%
Other values (365) 1723
75.3%
Common
ValueCountFrequency (%)
( 40
26.8%
) 40
26.8%
37
24.8%
1 6
 
4.0%
2 5
 
3.4%
8 5
 
3.4%
9 3
 
2.0%
0 3
 
2.0%
_ 3
 
2.0%
4 2
 
1.3%
Other values (3) 5
 
3.4%
Latin
ValueCountFrequency (%)
T 4
26.7%
C 4
26.7%
D 2
13.3%
I 1
 
6.7%
G 1
 
6.7%
S 1
 
6.7%
L 1
 
6.7%
B 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2287
93.3%
ASCII 164
 
6.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
82
 
3.6%
77
 
3.4%
67
 
2.9%
61
 
2.7%
58
 
2.5%
48
 
2.1%
47
 
2.1%
45
 
2.0%
40
 
1.7%
39
 
1.7%
Other values (365) 1723
75.3%
ASCII
ValueCountFrequency (%)
( 40
24.4%
) 40
24.4%
37
22.6%
1 6
 
3.7%
2 5
 
3.0%
8 5
 
3.0%
T 4
 
2.4%
C 4
 
2.4%
9 3
 
1.8%
0 3
 
1.8%
Other values (11) 17
10.4%
Distinct318
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T13:31:11.548105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length34.5
Mean length23.391566
Min length1

Characters and Unicode

Total characters7766
Distinct characters230
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique306 ?
Unique (%)92.2%

Sample

1st row강원도 춘천시 동면 만천로 163
2nd row강원도 춘천시 춘천로 71 (효자동_ 디아펠리즈)
3rd row강원도 춘천시 춘천로 429 (후평동)
4th row강원도 춘천시 우묵들길 37 (퇴계동)
5th row강원도 춘천시 동면 순환대로 1154-97
ValueCountFrequency (%)
강원도 331
 
18.5%
춘천시 331
 
18.5%
퇴계동 46
 
2.6%
동면 45
 
2.5%
1층 38
 
2.1%
후평동 27
 
1.5%
효자동 21
 
1.2%
석사동 21
 
1.2%
동내면 19
 
1.1%
신북읍 18
 
1.0%
Other values (445) 888
49.7%
2023-12-12T13:31:12.001312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1459
18.8%
405
 
5.2%
392
 
5.0%
355
 
4.6%
353
 
4.5%
336
 
4.3%
335
 
4.3%
287
 
3.7%
1 273
 
3.5%
234
 
3.0%
Other values (220) 3337
43.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4542
58.5%
Space Separator 1459
 
18.8%
Decimal Number 1165
 
15.0%
Open Punctuation 206
 
2.7%
Close Punctuation 206
 
2.7%
Connector Punctuation 116
 
1.5%
Dash Punctuation 58
 
0.7%
Uppercase Letter 7
 
0.1%
Math Symbol 5
 
0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
405
 
8.9%
392
 
8.6%
355
 
7.8%
353
 
7.8%
336
 
7.4%
335
 
7.4%
287
 
6.3%
234
 
5.2%
154
 
3.4%
110
 
2.4%
Other values (198) 1581
34.8%
Decimal Number
ValueCountFrequency (%)
1 273
23.4%
2 188
16.1%
3 131
11.2%
6 103
 
8.8%
4 93
 
8.0%
5 84
 
7.2%
8 82
 
7.0%
0 73
 
6.3%
7 72
 
6.2%
9 66
 
5.7%
Uppercase Letter
ValueCountFrequency (%)
C 4
57.1%
B 1
 
14.3%
T 1
 
14.3%
I 1
 
14.3%
Lowercase Letter
ValueCountFrequency (%)
m 1
50.0%
s 1
50.0%
Space Separator
ValueCountFrequency (%)
1459
100.0%
Open Punctuation
ValueCountFrequency (%)
( 206
100.0%
Close Punctuation
ValueCountFrequency (%)
) 206
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 116
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 58
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4542
58.5%
Common 3215
41.4%
Latin 9
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
405
 
8.9%
392
 
8.6%
355
 
7.8%
353
 
7.8%
336
 
7.4%
335
 
7.4%
287
 
6.3%
234
 
5.2%
154
 
3.4%
110
 
2.4%
Other values (198) 1581
34.8%
Common
ValueCountFrequency (%)
1459
45.4%
1 273
 
8.5%
( 206
 
6.4%
) 206
 
6.4%
2 188
 
5.8%
3 131
 
4.1%
_ 116
 
3.6%
6 103
 
3.2%
4 93
 
2.9%
5 84
 
2.6%
Other values (6) 356
 
11.1%
Latin
ValueCountFrequency (%)
C 4
44.4%
B 1
 
11.1%
T 1
 
11.1%
I 1
 
11.1%
m 1
 
11.1%
s 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4542
58.5%
ASCII 3224
41.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1459
45.3%
1 273
 
8.5%
( 206
 
6.4%
) 206
 
6.4%
2 188
 
5.8%
3 131
 
4.1%
_ 116
 
3.6%
6 103
 
3.2%
4 93
 
2.9%
5 84
 
2.6%
Other values (12) 365
 
11.3%
Hangul
ValueCountFrequency (%)
405
 
8.9%
392
 
8.6%
355
 
7.8%
353
 
7.8%
336
 
7.4%
335
 
7.4%
287
 
6.3%
234
 
5.2%
154
 
3.4%
110
 
2.4%
Other values (198) 1581
34.8%
Distinct298
Distinct (%)89.8%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T13:31:12.694543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length30
Mean length18.75
Min length1

Characters and Unicode

Total characters6225
Distinct characters159
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique280 ?
Unique (%)84.3%

Sample

1st row강원도 춘천시 동면 만천리 807-4
2nd row강원도 춘천시 효자동 681-20
3rd row강원도 춘천시 후평동 179-2
4th row강원도 춘천시 퇴계동 1136-8
5th row강원도 춘천시 동면 장학리 144-16
ValueCountFrequency (%)
강원도 319
22.2%
춘천시 319
22.2%
퇴계동 45
 
3.1%
동면 42
 
2.9%
후평동 25
 
1.7%
효자동 21
 
1.5%
만천리 21
 
1.5%
석사동 19
 
1.3%
동내면 18
 
1.3%
신북읍 18
 
1.3%
Other values (372) 590
41.1%
2023-12-12T13:31:13.244913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1451
23.3%
375
 
6.0%
330
 
5.3%
324
 
5.2%
323
 
5.2%
322
 
5.2%
320
 
5.1%
1 304
 
4.9%
274
 
4.4%
- 225
 
3.6%
Other values (149) 1977
31.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3321
53.3%
Space Separator 1451
23.3%
Decimal Number 1216
 
19.5%
Dash Punctuation 225
 
3.6%
Uppercase Letter 4
 
0.1%
Close Punctuation 3
 
< 0.1%
Open Punctuation 3
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
375
11.3%
330
9.9%
324
9.8%
323
9.7%
322
9.7%
320
9.6%
274
 
8.3%
123
 
3.7%
104
 
3.1%
46
 
1.4%
Other values (131) 780
23.5%
Decimal Number
ValueCountFrequency (%)
1 304
25.0%
3 137
11.3%
2 122
10.0%
4 107
 
8.8%
8 96
 
7.9%
6 95
 
7.8%
9 93
 
7.6%
7 91
 
7.5%
5 91
 
7.5%
0 80
 
6.6%
Uppercase Letter
ValueCountFrequency (%)
C 2
50.0%
I 1
25.0%
T 1
25.0%
Space Separator
ValueCountFrequency (%)
1451
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 225
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3321
53.3%
Common 2900
46.6%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
375
11.3%
330
9.9%
324
9.8%
323
9.7%
322
9.7%
320
9.6%
274
 
8.3%
123
 
3.7%
104
 
3.1%
46
 
1.4%
Other values (131) 780
23.5%
Common
ValueCountFrequency (%)
1451
50.0%
1 304
 
10.5%
- 225
 
7.8%
3 137
 
4.7%
2 122
 
4.2%
4 107
 
3.7%
8 96
 
3.3%
6 95
 
3.3%
9 93
 
3.2%
7 91
 
3.1%
Other values (5) 179
 
6.2%
Latin
ValueCountFrequency (%)
C 2
50.0%
I 1
25.0%
T 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3321
53.3%
ASCII 2904
46.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1451
50.0%
1 304
 
10.5%
- 225
 
7.7%
3 137
 
4.7%
2 122
 
4.2%
4 107
 
3.7%
8 96
 
3.3%
6 95
 
3.3%
9 93
 
3.2%
7 91
 
3.1%
Other values (8) 183
 
6.3%
Hangul
ValueCountFrequency (%)
375
11.3%
330
9.9%
324
9.8%
323
9.7%
322
9.7%
320
9.6%
274
 
8.3%
123
 
3.7%
104
 
3.1%
46
 
1.4%
Other values (131) 780
23.5%

사업장구분
Categorical

IMBALANCE 

Distinct6
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
일반음식점
208 
집단급식소
115 
휴게음식점
 
4
관광숙박시설
 
2
대규모점포
 
2

Length

Max length6
Median length5
Mean length4.996988
Min length2

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 208
62.7%
집단급식소 115
34.6%
휴게음식점 4
 
1.2%
관광숙박시설 2
 
0.6%
대규모점포 2
 
0.6%
기타 1
 
0.3%

Length

2023-12-12T13:31:13.443354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:31:13.609560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 208
62.7%
집단급식소 115
34.6%
휴게음식점 4
 
1.2%
관광숙박시설 2
 
0.6%
대규모점포 2
 
0.6%
기타 1
 
0.3%

규모(m2)
Real number (ℝ)

Distinct274
Distinct (%)82.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean493.60127
Minimum60
Maximum4708
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.0 KiB
2023-12-12T13:31:13.792271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum60
5-th percentile127.1
Q1238.6875
median300
Q3560.9825
95-th percentile1365.85
Maximum4708
Range4648
Interquartile range (IQR)322.295

Descriptive statistics

Standard deviation501.94192
Coefficient of variation (CV)1.0168976
Kurtosis20.154359
Mean493.60127
Median Absolute Deviation (MAD)91.365
Skewness3.6912636
Sum163875.62
Variance251945.69
MonotonicityNot monotonic
2023-12-12T13:31:13.993766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100.0 7
 
2.1%
1000.0 6
 
1.8%
250.0 6
 
1.8%
300.0 6
 
1.8%
400.0 5
 
1.5%
150.0 5
 
1.5%
1320.0 4
 
1.2%
350.0 4
 
1.2%
265.0 4
 
1.2%
510.0 3
 
0.9%
Other values (264) 282
84.9%
ValueCountFrequency (%)
60.0 1
 
0.3%
63.0 1
 
0.3%
78.0 1
 
0.3%
100.0 7
2.1%
110.0 3
0.9%
120.0 3
0.9%
126.0 1
 
0.3%
128.0 1
 
0.3%
130.0 1
 
0.3%
140.0 1
 
0.3%
ValueCountFrequency (%)
4708.0 1
 
0.3%
3128.76 1
 
0.3%
3000.0 1
 
0.3%
2600.0 1
 
0.3%
2500.0 1
 
0.3%
2431.42 1
 
0.3%
1900.0 1
 
0.3%
1850.0 1
 
0.3%
1640.0 1
 
0.3%
1600.0 3
0.9%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
Minimum2021-10-28 00:00:00
Maximum2021-10-28 00:00:00
2023-12-12T13:31:14.146174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:31:14.301151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T13:31:10.237064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:31:14.381757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업장구분규모(m2)
사업장구분1.0000.594
규모(m2)0.5941.000
2023-12-12T13:31:14.466383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
규모(m2)사업장구분
규모(m2)1.0000.405
사업장구분0.4051.000

Missing values

2023-12-12T13:31:10.399414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:31:10.515728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호사업장도로명주소사업장지번주소사업장구분규모(m2)데이터기준일자
0하주골강원도 춘천시 동면 만천로 163강원도 춘천시 동면 만천리 807-4일반음식점300.752021-10-28
1디아펠리즈강원도 춘천시 춘천로 71 (효자동_ 디아펠리즈)강원도 춘천시 효자동 681-20일반음식점2431.422021-10-28
2돈가스클럽강원도 춘천시 춘천로 429 (후평동)강원도 춘천시 후평동 179-2일반음식점295.252021-10-28
3푸주옥강원도 춘천시 우묵들길 37 (퇴계동)강원도 춘천시 퇴계동 1136-8일반음식점311.662021-10-28
4산토리니강원도 춘천시 동면 순환대로 1154-97강원도 춘천시 동면 장학리 144-16일반음식점641.522021-10-28
5봉운장강원도 춘천시 소양고개길 26 (소양로3가)강원도 춘천시 소양로3가 4일반음식점435.122021-10-28
6회양회집강원도 춘천시 서면 삿갓봉길 21강원도 춘천시 서면 오월리 96-1일반음식점260.192021-10-28
7춘천농민한우강원도 춘천시 충열로 90 (우두동)강원도 춘천시 우두동 413-1일반음식점330.962021-10-28
8쟈스민강원도 춘천시 우묵길74번길 14 (퇴계동)강원도 춘천시 퇴계동 1159-4일반음식점480.42021-10-28
9에스엠이벤트회관강원도 춘천시 동면 방죽길 28-26_ 1층강원도 춘천시 동면 장학리 685-7일반음식점257.372021-10-28
상호사업장도로명주소사업장지번주소사업장구분규모(m2)데이터기준일자
322유림닭갈비강원도 춘천시 안마산로 34 (온의동)강원도 춘천시 온의동 329-3일반음식점473.042021-10-28
323금산초등학교강원도 춘천시 서면 금산2길 21강원도 춘천시 서면 금산리 472-1집단급식소110.02021-10-28
324청궁강원도 춘천시 동내면 거두택지길 39강원도 춘천시 동내면 거두리 1090-3일반음식점231.82021-10-28
325치엔롱강원도 춘천시 중앙로68번길 12 (낙원동_ 춘천관광호텔 2층)강원도 춘천시 낙원동 30-1일반음식점280.12021-10-28
326(주)한국고용정보강원도 춘천시 영서로 2491 (근화동)강원도 춘천시 근화동 808집단급식소250.02021-10-28
327춘천시노인전문병원강원도 춘천시 동면 세실로 252강원도 춘천시 동면 만천리 811-5집단급식소550.02021-10-28
328봄내병원강원도 춘천시 공지로 270 (효자동)강원도 춘천시 효자동 312-1집단급식소520.02021-10-28
329강원도재활병원강원도 춘천시 충열로142번길 24-16 (우두동)강원도 춘천시 우두동 291-2집단급식소400.02021-10-28
330만천초등학교강원도 춘천시 동면 만천로 165-7강원도 춘천시 동면 만천리 807-5집단급식소1050.02021-10-28
331델모니코스강원도 춘천시 동면 순환대로 1154-106 (1~2)층강원도 춘천시 동면 장학리 139-55 (1~2층)일반음식점366.992021-10-28