Overview

Dataset statistics

Number of variables5
Number of observations287
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.6 KiB
Average record size in memory41.5 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description화성시 사업장폐기물 처리업체 현황에 관한 데이터로 연번, 처리업종, 업소명, 소재지 도로명주소, 전화번호에 관한 데이터를 포함하고 있습니다.
Author경기도 화성시
URLhttps://www.data.go.kr/data/3046882/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:41:02.350781
Analysis finished2023-12-12 17:41:02.843881
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct287
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean144
Minimum1
Maximum287
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2023-12-13T02:41:02.911483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile15.3
Q172.5
median144
Q3215.5
95-th percentile272.7
Maximum287
Range286
Interquartile range (IQR)143

Descriptive statistics

Standard deviation82.993976
Coefficient of variation (CV)0.57634705
Kurtosis-1.2
Mean144
Median Absolute Deviation (MAD)72
Skewness0
Sum41328
Variance6888
MonotonicityStrictly increasing
2023-12-13T02:41:03.044390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
2 1
 
0.3%
197 1
 
0.3%
196 1
 
0.3%
195 1
 
0.3%
194 1
 
0.3%
193 1
 
0.3%
192 1
 
0.3%
191 1
 
0.3%
190 1
 
0.3%
Other values (277) 277
96.5%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
287 1
0.3%
286 1
0.3%
285 1
0.3%
284 1
0.3%
283 1
0.3%
282 1
0.3%
281 1
0.3%
280 1
0.3%
279 1
0.3%
278 1
0.3%

처리업종
Categorical

Distinct8
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
사업장배출
119 
사업장배출시설계
88 
사업장생활
55 
생활운반
 
8
사업장생활계
 
7
Other values (3)
 
10

Length

Max length11
Median length5
Mean length6.1045296
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row생활운반
2nd row사업장배출
3rd row생활운반
4th row사업장배출
5th row생활운반

Common Values

ValueCountFrequency (%)
사업장배출 119
41.5%
사업장배출시설계 88
30.7%
사업장생활 55
19.2%
생활운반 8
 
2.8%
사업장생활계 7
 
2.4%
사업장배출시설계폐기물 5
 
1.7%
사업장배출시설계페기물 3
 
1.0%
사업장배출시설꼐 2
 
0.7%

Length

2023-12-13T02:41:03.201251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:41:03.301632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장배출 119
41.5%
사업장배출시설계 88
30.7%
사업장생활 55
19.2%
생활운반 8
 
2.8%
사업장생활계 7
 
2.4%
사업장배출시설계폐기물 5
 
1.7%
사업장배출시설계페기물 3
 
1.0%
사업장배출시설꼐 2
 
0.7%
Distinct262
Distinct (%)91.3%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-12-13T02:41:03.521436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length5.2055749
Min length2

Characters and Unicode

Total characters1494
Distinct characters216
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique238 ?
Unique (%)82.9%

Sample

1st row㈜평촌
2nd row㈜태산환경
3rd row㈜화성
4th row바다환경㈜
5th row향원실업㈜
ValueCountFrequency (%)
주식회사 6
 
2.0%
바다환경㈜ 3
 
1.0%
㈜한양실업 2
 
0.7%
㈜삼보운수 2
 
0.7%
화성지점 2
 
0.7%
㈜대우자원 2
 
0.7%
꿈에그린㈜ 2
 
0.7%
주)은호이앤티 2
 
0.7%
신화자원 2
 
0.7%
미래환경 2
 
0.7%
Other values (254) 271
91.6%
2023-12-13T02:41:04.085485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
144
 
9.6%
98
 
6.6%
93
 
6.2%
65
 
4.4%
53
 
3.5%
36
 
2.4%
35
 
2.3%
34
 
2.3%
32
 
2.1%
31
 
2.1%
Other values (206) 873
58.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1317
88.2%
Other Symbol 144
 
9.6%
Space Separator 12
 
0.8%
Uppercase Letter 8
 
0.5%
Close Punctuation 5
 
0.3%
Open Punctuation 5
 
0.3%
Dash Punctuation 2
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
98
 
7.4%
93
 
7.1%
65
 
4.9%
53
 
4.0%
36
 
2.7%
35
 
2.7%
34
 
2.6%
32
 
2.4%
31
 
2.4%
30
 
2.3%
Other values (195) 810
61.5%
Uppercase Letter
ValueCountFrequency (%)
M 2
25.0%
T 2
25.0%
K 2
25.0%
S 1
12.5%
H 1
12.5%
Other Symbol
ValueCountFrequency (%)
144
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1461
97.8%
Common 25
 
1.7%
Latin 8
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
144
 
9.9%
98
 
6.7%
93
 
6.4%
65
 
4.4%
53
 
3.6%
36
 
2.5%
35
 
2.4%
34
 
2.3%
32
 
2.2%
31
 
2.1%
Other values (196) 840
57.5%
Common
ValueCountFrequency (%)
12
48.0%
) 5
20.0%
( 5
20.0%
- 2
 
8.0%
& 1
 
4.0%
Latin
ValueCountFrequency (%)
M 2
25.0%
T 2
25.0%
K 2
25.0%
S 1
12.5%
H 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1317
88.2%
None 144
 
9.6%
ASCII 33
 
2.2%

Most frequent character per block

None
ValueCountFrequency (%)
144
100.0%
Hangul
ValueCountFrequency (%)
98
 
7.4%
93
 
7.1%
65
 
4.9%
53
 
4.0%
36
 
2.7%
35
 
2.7%
34
 
2.6%
32
 
2.4%
31
 
2.4%
30
 
2.3%
Other values (195) 810
61.5%
ASCII
ValueCountFrequency (%)
12
36.4%
) 5
15.2%
( 5
15.2%
- 2
 
6.1%
M 2
 
6.1%
T 2
 
6.1%
K 2
 
6.1%
S 1
 
3.0%
H 1
 
3.0%
& 1
 
3.0%
Distinct265
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-12-13T02:41:04.402942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length47
Mean length28.181185
Min length7

Characters and Unicode

Total characters8088
Distinct characters215
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique248 ?
Unique (%)86.4%

Sample

1st row경기도 화성시 향남읍 한두골길 21-53
2nd row경기도 화성시 효행로 1011, 2층 (진안동, 명성빌딩)
3rd row경기도 화성시 봉담읍 분천길95번길 88
4th row경기도 화성시 봉담읍 왕림북길 14, 2층
5th row경기도 화성시 장안면 3.1만세로322번길 41
ValueCountFrequency (%)
화성시 283
 
18.5%
경기도 281
 
18.4%
팔탄면 44
 
2.9%
향남읍 31
 
2.0%
정남면 22
 
1.4%
봉담읍 22
 
1.4%
2층 20
 
1.3%
마도면 18
 
1.2%
양감면 16
 
1.0%
장안면 15
 
1.0%
Other values (505) 774
50.7%
2023-12-13T02:41:04.821089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2444
30.2%
308
 
3.8%
300
 
3.7%
295
 
3.6%
294
 
3.6%
291
 
3.6%
1 287
 
3.5%
287
 
3.5%
2 210
 
2.6%
203
 
2.5%
Other values (205) 3169
39.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3932
48.6%
Space Separator 2444
30.2%
Decimal Number 1387
 
17.1%
Dash Punctuation 134
 
1.7%
Other Punctuation 93
 
1.1%
Close Punctuation 44
 
0.5%
Open Punctuation 44
 
0.5%
Uppercase Letter 10
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
308
 
7.8%
300
 
7.6%
295
 
7.5%
294
 
7.5%
291
 
7.4%
287
 
7.3%
203
 
5.2%
141
 
3.6%
139
 
3.5%
101
 
2.6%
Other values (186) 1573
40.0%
Decimal Number
ValueCountFrequency (%)
1 287
20.7%
2 210
15.1%
3 149
10.7%
0 146
10.5%
4 127
9.2%
5 111
 
8.0%
6 107
 
7.7%
9 85
 
6.1%
7 85
 
6.1%
8 80
 
5.8%
Uppercase Letter
ValueCountFrequency (%)
B 8
80.0%
A 1
 
10.0%
C 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
, 88
94.6%
. 5
 
5.4%
Space Separator
ValueCountFrequency (%)
2444
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 134
100.0%
Close Punctuation
ValueCountFrequency (%)
) 44
100.0%
Open Punctuation
ValueCountFrequency (%)
( 44
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4146
51.3%
Hangul 3932
48.6%
Latin 10
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
308
 
7.8%
300
 
7.6%
295
 
7.5%
294
 
7.5%
291
 
7.4%
287
 
7.3%
203
 
5.2%
141
 
3.6%
139
 
3.5%
101
 
2.6%
Other values (186) 1573
40.0%
Common
ValueCountFrequency (%)
2444
58.9%
1 287
 
6.9%
2 210
 
5.1%
3 149
 
3.6%
0 146
 
3.5%
- 134
 
3.2%
4 127
 
3.1%
5 111
 
2.7%
6 107
 
2.6%
, 88
 
2.1%
Other values (6) 343
 
8.3%
Latin
ValueCountFrequency (%)
B 8
80.0%
A 1
 
10.0%
C 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4156
51.4%
Hangul 3932
48.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2444
58.8%
1 287
 
6.9%
2 210
 
5.1%
3 149
 
3.6%
0 146
 
3.5%
- 134
 
3.2%
4 127
 
3.1%
5 111
 
2.7%
6 107
 
2.6%
, 88
 
2.1%
Other values (9) 353
 
8.5%
Hangul
ValueCountFrequency (%)
308
 
7.8%
300
 
7.6%
295
 
7.5%
294
 
7.5%
291
 
7.4%
287
 
7.3%
203
 
5.2%
141
 
3.6%
139
 
3.5%
101
 
2.6%
Other values (186) 1573
40.0%
Distinct193
Distinct (%)67.2%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-12-13T02:41:05.085984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length10.867596
Min length7

Characters and Unicode

Total characters3119
Distinct characters19
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique176 ?
Unique (%)61.3%

Sample

1st row031-353-6263
2nd row031-226-7578
3rd row031-227-5157
4th row031-291-2029
5th row031-351-3223
ValueCountFrequency (%)
데이터 78
21.4%
미집계 78
21.4%
031-372-2191 3
 
0.8%
031-366-8432 2
 
0.5%
031-359-8491 2
 
0.5%
031-295-3079 2
 
0.5%
031-353-8013 2
 
0.5%
031-225-0890 2
 
0.5%
031-298-0388 2
 
0.5%
031-233-2537 2
 
0.5%
Other values (179) 192
52.6%
2023-12-13T02:41:05.521156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 476
15.3%
- 418
13.4%
0 320
10.3%
1 310
9.9%
2 196
 
6.3%
5 190
 
6.1%
8 140
 
4.5%
135
 
4.3%
6 132
 
4.2%
7 126
 
4.0%
Other values (9) 676
21.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2097
67.2%
Other Letter 468
 
15.0%
Dash Punctuation 418
 
13.4%
Space Separator 135
 
4.3%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 476
22.7%
0 320
15.3%
1 310
14.8%
2 196
9.3%
5 190
 
9.1%
8 140
 
6.7%
6 132
 
6.3%
7 126
 
6.0%
9 111
 
5.3%
4 96
 
4.6%
Other Letter
ValueCountFrequency (%)
78
16.7%
78
16.7%
78
16.7%
78
16.7%
78
16.7%
78
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 418
100.0%
Space Separator
ValueCountFrequency (%)
135
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2651
85.0%
Hangul 468
 
15.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 476
18.0%
- 418
15.8%
0 320
12.1%
1 310
11.7%
2 196
7.4%
5 190
 
7.2%
8 140
 
5.3%
135
 
5.1%
6 132
 
5.0%
7 126
 
4.8%
Other values (3) 208
7.8%
Hangul
ValueCountFrequency (%)
78
16.7%
78
16.7%
78
16.7%
78
16.7%
78
16.7%
78
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2651
85.0%
Hangul 468
 
15.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 476
18.0%
- 418
15.8%
0 320
12.1%
1 310
11.7%
2 196
7.4%
5 190
 
7.2%
8 140
 
5.3%
135
 
5.1%
6 132
 
5.0%
7 126
 
4.8%
Other values (3) 208
7.8%
Hangul
ValueCountFrequency (%)
78
16.7%
78
16.7%
78
16.7%
78
16.7%
78
16.7%
78
16.7%

Interactions

2023-12-13T02:41:02.615457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:41:05.616367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번처리업종
연번1.0000.667
처리업종0.6671.000
2023-12-13T02:41:05.691460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번처리업종
연번1.0000.396
처리업종0.3961.000

Missing values

2023-12-13T02:41:02.723732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:41:02.810874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번처리업종업소명소재지 도로명주소전화번호
01생활운반㈜평촌경기도 화성시 향남읍 한두골길 21-53031-353-6263
12사업장배출㈜태산환경경기도 화성시 효행로 1011, 2층 (진안동, 명성빌딩)031-226-7578
23생활운반㈜화성경기도 화성시 봉담읍 분천길95번길 88031-227-5157
34사업장배출바다환경㈜경기도 화성시 봉담읍 왕림북길 14, 2층031-291-2029
45생활운반향원실업㈜경기도 화성시 장안면 3.1만세로322번길 41031-351-3223
56사업장배출경희환경경기도 화성시 향남읍 향남로 416 상도프라자 703호데이터 미집계
67생활운반㈜원천환경경기도 화성시 향남읍 만년로151번길 62031-353-7333
78사업장배출우진환경경기도 화성시 양감면 발안로 840031-354-4411
89사업장배출조은환경㈜경기도 화성시 효행로 512 (안녕동)031-221-8676
910사업장배출㈜에버그린경기도 화성시 떡전골로 112-13,3층(진안동, 경원빌딩)데이터 미집계
연번처리업종업소명소재지 도로명주소전화번호
277278사업장배출시설계이엘테크경기도 화성시 정남면 내향로 240-67데이터 미집계
278279사업장배출시설계진성수지경기도 화성시 마도면 마도로 334-5데이터 미집계
279280사업장배출시설계현민자원경기도 화성시 팔탄면 삼천병마로 355-19데이터 미집계
280281사업장생활계㈜평화알씨경기도 화성시 동탄기흥로 590, B동 922호031-374-7842
281282사업장배출시설계다원자원경기도 화성시 우정읍 원금의길 126, 1층데이터 미집계
282283사업장배출시설계㈜케이제이이알에스데이터 미집계데이터 미집계
283284사업장생활계주식회사 대진환경경기도 화성시 향남읍 하길로9, 1109동 1203호031-354-9307
284285사업장배출시설계주식회사 대진환경경기도 화성시 향남읍 하길로9, 1109동 1203호031-354-9307
285286사업장배출시설계부림상사경기도 화성시 팔탄면 율암리 830-1데이터 미집계
286287사업장배출시설계송산산업㈜경기도 화성시 송산면 송산포도로 275-46데이터 미집계