Overview

Dataset statistics

Number of variables5
Number of observations320
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.3 KiB
Average record size in memory42.4 B

Variable types

Numeric2
Text2
Categorical1

Dataset

Description경기도 의정부시 내 음식물류 폐기물 다량배출사업장 현황 데이터로 연번, 상호, 사업장 주소(도로명), 사업장 구분, 연배출예상량(킬로그램)량 등의 항목으로 구성되어 있습니다.
Author경기도 의정부시
URLhttps://www.data.go.kr/data/15034438/fileData.do

Alerts

사업장 구분 is highly imbalanced (50.6%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:11:40.915959
Analysis finished2023-12-12 21:11:41.808251
Duration0.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct320
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean160.5
Minimum1
Maximum320
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.9 KiB
2023-12-13T06:11:41.879238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile16.95
Q180.75
median160.5
Q3240.25
95-th percentile304.05
Maximum320
Range319
Interquartile range (IQR)159.5

Descriptive statistics

Standard deviation92.520268
Coefficient of variation (CV)0.57645027
Kurtosis-1.2
Mean160.5
Median Absolute Deviation (MAD)80
Skewness0
Sum51360
Variance8560
MonotonicityStrictly increasing
2023-12-13T06:11:42.030818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
162 1
 
0.3%
220 1
 
0.3%
219 1
 
0.3%
218 1
 
0.3%
217 1
 
0.3%
216 1
 
0.3%
215 1
 
0.3%
214 1
 
0.3%
213 1
 
0.3%
Other values (310) 310
96.9%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
320 1
0.3%
319 1
0.3%
318 1
0.3%
317 1
0.3%
316 1
0.3%
315 1
0.3%
314 1
0.3%
313 1
0.3%
312 1
0.3%
311 1
0.3%

상호
Text

Distinct311
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-13T06:11:42.303211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length20
Mean length7.659375
Min length2

Characters and Unicode

Total characters2451
Distinct characters391
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique302 ?
Unique (%)94.4%

Sample

1st row솔뫼초등학교
2nd row3.3보리밥뷔페
3rd row동암초등학교
4th row의정부효자초등학교
5th row항도일식
ValueCountFrequency (%)
의정부점 10
 
2.4%
주식회사 6
 
1.4%
구내식당 5
 
1.2%
민락점 4
 
1.0%
의료법인 3
 
0.7%
의정부 3
 
0.7%
주)아워홈 3
 
0.7%
마스터병원 2
 
0.5%
명륜진사갈비 2
 
0.5%
구끼구끼 2
 
0.5%
Other values (362) 374
90.3%
2023-12-13T06:11:42.715876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
94
 
3.8%
79
 
3.2%
78
 
3.2%
71
 
2.9%
70
 
2.9%
65
 
2.7%
55
 
2.2%
53
 
2.2%
46
 
1.9%
) 46
 
1.9%
Other values (381) 1794
73.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2239
91.4%
Space Separator 94
 
3.8%
Close Punctuation 46
 
1.9%
Open Punctuation 46
 
1.9%
Decimal Number 12
 
0.5%
Uppercase Letter 9
 
0.4%
Other Punctuation 3
 
0.1%
Connector Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
79
 
3.5%
78
 
3.5%
71
 
3.2%
70
 
3.1%
65
 
2.9%
55
 
2.5%
53
 
2.4%
46
 
2.1%
39
 
1.7%
37
 
1.7%
Other values (363) 1646
73.5%
Decimal Number
ValueCountFrequency (%)
2 4
33.3%
0 3
25.0%
3 2
16.7%
9 1
 
8.3%
5 1
 
8.3%
1 1
 
8.3%
Uppercase Letter
ValueCountFrequency (%)
B 3
33.3%
T 2
22.2%
Q 1
 
11.1%
N 1
 
11.1%
I 1
 
11.1%
K 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
& 2
66.7%
. 1
33.3%
Space Separator
ValueCountFrequency (%)
94
100.0%
Close Punctuation
ValueCountFrequency (%)
) 46
100.0%
Open Punctuation
ValueCountFrequency (%)
( 46
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2239
91.4%
Common 203
 
8.3%
Latin 9
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
79
 
3.5%
78
 
3.5%
71
 
3.2%
70
 
3.1%
65
 
2.9%
55
 
2.5%
53
 
2.4%
46
 
2.1%
39
 
1.7%
37
 
1.7%
Other values (363) 1646
73.5%
Common
ValueCountFrequency (%)
94
46.3%
) 46
22.7%
( 46
22.7%
2 4
 
2.0%
0 3
 
1.5%
3 2
 
1.0%
_ 2
 
1.0%
& 2
 
1.0%
9 1
 
0.5%
5 1
 
0.5%
Other values (2) 2
 
1.0%
Latin
ValueCountFrequency (%)
B 3
33.3%
T 2
22.2%
Q 1
 
11.1%
N 1
 
11.1%
I 1
 
11.1%
K 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2239
91.4%
ASCII 212
 
8.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
94
44.3%
) 46
21.7%
( 46
21.7%
2 4
 
1.9%
0 3
 
1.4%
B 3
 
1.4%
3 2
 
0.9%
T 2
 
0.9%
_ 2
 
0.9%
& 2
 
0.9%
Other values (8) 8
 
3.8%
Hangul
ValueCountFrequency (%)
79
 
3.5%
78
 
3.5%
71
 
3.2%
70
 
3.1%
65
 
2.9%
55
 
2.5%
53
 
2.4%
46
 
2.1%
39
 
1.7%
37
 
1.7%
Other values (363) 1646
73.5%
Distinct299
Distinct (%)93.4%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-13T06:11:42.996618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length40
Mean length27.959375
Min length20

Characters and Unicode

Total characters8947
Distinct characters233
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique282 ?
Unique (%)88.1%

Sample

1st row경기도 의정부시 시민로 442 (용현동_ 솔뫼초등학교)
2nd row경기도 의정부시 태평로73번길 20 (의정부동_ 의정부제일시장)
3rd row경기도 의정부시 장곡로 206 (장암동_ 동암초등학교)
4th row경기도 의정부시 부용로204번길 16 (신곡동_ 효자초등학교)
5th row경기도 의정부시 의정로46번길 33 (의정부동)
ValueCountFrequency (%)
경기도 318
 
17.3%
의정부시 318
 
17.3%
의정부동 79
 
4.3%
신곡동 51
 
2.8%
민락동 35
 
1.9%
장암동 27
 
1.5%
평화로 27
 
1.5%
호원동 26
 
1.4%
가능동 23
 
1.3%
1층 21
 
1.1%
Other values (487) 914
49.7%
2023-12-13T06:11:43.443257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1519
 
17.0%
452
 
5.1%
443
 
5.0%
429
 
4.8%
372
 
4.2%
352
 
3.9%
331
 
3.7%
330
 
3.7%
322
 
3.6%
( 320
 
3.6%
Other values (223) 4077
45.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5334
59.6%
Space Separator 1519
 
17.0%
Decimal Number 1219
 
13.6%
Open Punctuation 320
 
3.6%
Close Punctuation 320
 
3.6%
Connector Punctuation 183
 
2.0%
Dash Punctuation 39
 
0.4%
Uppercase Letter 7
 
0.1%
Math Symbol 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
452
 
8.5%
443
 
8.3%
429
 
8.0%
372
 
7.0%
352
 
6.6%
331
 
6.2%
330
 
6.2%
322
 
6.0%
319
 
6.0%
94
 
1.8%
Other values (205) 1890
35.4%
Decimal Number
ValueCountFrequency (%)
1 232
19.0%
2 220
18.0%
5 135
11.1%
4 125
10.3%
3 103
8.4%
6 95
7.8%
9 87
 
7.1%
0 84
 
6.9%
7 69
 
5.7%
8 69
 
5.7%
Uppercase Letter
ValueCountFrequency (%)
A 5
71.4%
B 2
 
28.6%
Space Separator
ValueCountFrequency (%)
1519
100.0%
Open Punctuation
ValueCountFrequency (%)
( 320
100.0%
Close Punctuation
ValueCountFrequency (%)
) 320
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 183
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 39
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5334
59.6%
Common 3606
40.3%
Latin 7
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
452
 
8.5%
443
 
8.3%
429
 
8.0%
372
 
7.0%
352
 
6.6%
331
 
6.2%
330
 
6.2%
322
 
6.0%
319
 
6.0%
94
 
1.8%
Other values (205) 1890
35.4%
Common
ValueCountFrequency (%)
1519
42.1%
( 320
 
8.9%
) 320
 
8.9%
1 232
 
6.4%
2 220
 
6.1%
_ 183
 
5.1%
5 135
 
3.7%
4 125
 
3.5%
3 103
 
2.9%
6 95
 
2.6%
Other values (6) 354
 
9.8%
Latin
ValueCountFrequency (%)
A 5
71.4%
B 2
 
28.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5334
59.6%
ASCII 3613
40.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1519
42.0%
( 320
 
8.9%
) 320
 
8.9%
1 232
 
6.4%
2 220
 
6.1%
_ 183
 
5.1%
5 135
 
3.7%
4 125
 
3.5%
3 103
 
2.9%
6 95
 
2.6%
Other values (8) 361
 
10.0%
Hangul
ValueCountFrequency (%)
452
 
8.5%
443
 
8.3%
429
 
8.0%
372
 
7.0%
352
 
6.6%
331
 
6.2%
330
 
6.2%
322
 
6.0%
319
 
6.0%
94
 
1.8%
Other values (205) 1890
35.4%

사업장 구분
Categorical

IMBALANCE 

Distinct5
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
일반음식점
181 
집단급식소
132 
휴게음식점
 
3
대규모점포
 
2
관광숙박시설
 
2

Length

Max length6
Median length5
Mean length5.00625
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row집단급식소
2nd row일반음식점
3rd row집단급식소
4th row집단급식소
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 181
56.6%
집단급식소 132
41.2%
휴게음식점 3
 
0.9%
대규모점포 2
 
0.6%
관광숙박시설 2
 
0.6%

Length

2023-12-13T06:11:43.588969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:11:43.713187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 181
56.6%
집단급식소 132
41.2%
휴게음식점 3
 
0.9%
대규모점포 2
 
0.6%
관광숙박시설 2
 
0.6%
Distinct159
Distinct (%)49.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27993.588
Minimum180
Maximum600000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.9 KiB
2023-12-13T06:11:43.833329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum180
5-th percentile3600
Q110560
median18000
Q330300
95-th percentile72240
Maximum600000
Range599820
Interquartile range (IQR)19740

Descriptive statistics

Standard deviation47974.815
Coefficient of variation (CV)1.7137787
Kurtosis82.295692
Mean27993.588
Median Absolute Deviation (MAD)9920
Skewness8.0430914
Sum8957948
Variance2.3015829 × 109
MonotonicityNot monotonic
2023-12-13T06:11:43.968364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18000 18
 
5.6%
14400 16
 
5.0%
7200 14
 
4.4%
28800 13
 
4.1%
10800 11
 
3.4%
12000 10
 
3.1%
36000 10
 
3.1%
21600 9
 
2.8%
3600 7
 
2.2%
32400 5
 
1.6%
Other values (149) 207
64.7%
ValueCountFrequency (%)
180 1
 
0.3%
420 1
 
0.3%
780 1
 
0.3%
1548 1
 
0.3%
1800 1
 
0.3%
2760 1
 
0.3%
2880 1
 
0.3%
3000 1
 
0.3%
3240 2
 
0.6%
3600 7
2.2%
ValueCountFrequency (%)
600000 1
0.3%
448800 1
0.3%
219000 1
0.3%
182160 1
0.3%
174240 1
0.3%
157200 1
0.3%
144000 1
0.3%
125280 1
0.3%
110880 1
0.3%
108000 1
0.3%

Interactions

2023-12-13T06:11:41.477343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:11:41.298983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:11:41.563078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:11:41.395208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:11:44.067616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업장 구분연배출예상량(킬로그램)
연번1.0000.5020.000
사업장 구분0.5021.0000.485
연배출예상량(킬로그램)0.0000.4851.000
2023-12-13T06:11:44.186093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번연배출예상량(킬로그램)사업장 구분
연번1.000-0.0470.230
연배출예상량(킬로그램)-0.0471.0000.354
사업장 구분0.2300.3541.000

Missing values

2023-12-13T06:11:41.671590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:11:41.766018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호사업장 주소(도로명)사업장 구분연배출예상량(킬로그램)
01솔뫼초등학교경기도 의정부시 시민로 442 (용현동_ 솔뫼초등학교)집단급식소9120
123.3보리밥뷔페경기도 의정부시 태평로73번길 20 (의정부동_ 의정부제일시장)일반음식점23160
23동암초등학교경기도 의정부시 장곡로 206 (장암동_ 동암초등학교)집단급식소25800
34의정부효자초등학교경기도 의정부시 부용로204번길 16 (신곡동_ 효자초등학교)집단급식소15600
45항도일식경기도 의정부시 의정로46번길 33 (의정부동)일반음식점13200
56청해수산회천국경기도 의정부시 범골로 96_ 2층 (의정부동)일반음식점36000
67노바웨딩홀경기도 의정부시 둔야로 11 (의정부동)일반음식점4800
78신곡추동회관경기도 의정부시 능곡로26번길 26 (신곡동)일반음식점19080
89의정부중앙초등학교경기도 의정부시 호국로1291번길 17 (의정부동_ 중앙초등학교)집단급식소11640
910의료법인 영동의료재단경기도 의정부시 금신로 322 (신곡동_ 의정부백병원)집단급식소18000
연번상호사업장 주소(도로명)사업장 구분연배출예상량(킬로그램)
310311곰서방한식뷔페경기도 의정부시 시민로 49_ B동 101호 (가능동_ 신동아파라디움)일반음식점18000
311312홍천사랑말 한우식당 의정부점경기도 의정부시 평화로 603 (의정부동)일반음식점9000
312313고수경기도 의정부시 평화로 252 (호원동)일반음식점21600
313314부용중학교경기도 의정부시 오목로 86 (민락동_ 부용중학교)집단급식소48000
314315스시히로미경기도 의정부시 발곡로 24 (신곡동)일반음식점7200
315316채선당플러스(민락점)경기도 의정부시 오목로225번길 140 (민락동)일반음식점21600
316317외양간한우경기도 의정부시 고산로 177 (고산동)일반음식점9360
317318국제크리스천학교경기도 의정부시 진등로 28 (녹양동)집단급식소7200
318319(주)포유에프앤비호원경기도 의정부시 평화로 400 (호원동)일반음식점13200
319320고기먹는날경기도 의정부시 평화로 278 (호원동)일반음식점14400