Overview

Dataset statistics

Number of variables5
Number of observations206
Missing cells7
Missing cells (%)0.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.4 KiB
Average record size in memory41.6 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description경상남도 내 건설폐기물 처리업체(수집운반업 및 중간처분업)에 관한 현황입니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3084077

Alerts

연번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번High correlation
전화번호 has 7 (3.4%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:27:40.822544
Analysis finished2023-12-11 00:27:41.364650
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct206
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean103.5
Minimum1
Maximum206
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-11T09:27:41.457862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile11.25
Q152.25
median103.5
Q3154.75
95-th percentile195.75
Maximum206
Range205
Interquartile range (IQR)102.5

Descriptive statistics

Standard deviation59.611241
Coefficient of variation (CV)0.57595401
Kurtosis-1.2
Mean103.5
Median Absolute Deviation (MAD)51.5
Skewness0
Sum21321
Variance3553.5
MonotonicityStrictly increasing
2023-12-11T09:27:41.588052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
143 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
136 1
 
0.5%
137 1
 
0.5%
138 1
 
0.5%
139 1
 
0.5%
140 1
 
0.5%
Other values (196) 196
95.1%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
206 1
0.5%
205 1
0.5%
204 1
0.5%
203 1
0.5%
202 1
0.5%
201 1
0.5%
200 1
0.5%
199 1
0.5%
198 1
0.5%
197 1
0.5%

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
수집운반업
144 
중간처분업
62 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수집운반업
2nd row수집운반업
3rd row수집운반업
4th row수집운반업
5th row수집운반업

Common Values

ValueCountFrequency (%)
수집운반업 144
69.9%
중간처분업 62
30.1%

Length

2023-12-11T09:27:41.708457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:27:41.809027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수집운반업 144
69.9%
중간처분업 62
30.1%
Distinct158
Distinct (%)76.7%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-11T09:27:42.010875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length7.9660194
Min length3

Characters and Unicode

Total characters1641
Distinct characters164
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique110 ?
Unique (%)53.4%

Sample

1st row현대환경개발(주)
2nd row(주)현대자원
3rd row명성환경
4th row(주)아시아환경
5th row동광골재환경
ValueCountFrequency (%)
주식회사 10
 
4.5%
현대환경개발(주 2
 
0.9%
원지지점(건설 2
 
0.9%
주)상원엔텍 2
 
0.9%
태성개발(주 2
 
0.9%
한맥(주)-밀양 2
 
0.9%
주)이엔에프(밀양 2
 
0.9%
주)정우개발 2
 
0.9%
한통아스콘(주 2
 
0.9%
금광개발(주 2
 
0.9%
Other values (154) 194
87.4%
2023-12-11T09:27:42.417518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 189
 
11.5%
( 188
 
11.5%
166
 
10.1%
92
 
5.6%
85
 
5.2%
60
 
3.7%
56
 
3.4%
25
 
1.5%
24
 
1.5%
24
 
1.5%
Other values (154) 732
44.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1240
75.6%
Close Punctuation 189
 
11.5%
Open Punctuation 188
 
11.5%
Space Separator 16
 
1.0%
Dash Punctuation 8
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
166
 
13.4%
92
 
7.4%
85
 
6.9%
60
 
4.8%
56
 
4.5%
25
 
2.0%
24
 
1.9%
24
 
1.9%
22
 
1.8%
22
 
1.8%
Other values (150) 664
53.5%
Close Punctuation
ValueCountFrequency (%)
) 189
100.0%
Open Punctuation
ValueCountFrequency (%)
( 188
100.0%
Space Separator
ValueCountFrequency (%)
16
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1240
75.6%
Common 401
 
24.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
166
 
13.4%
92
 
7.4%
85
 
6.9%
60
 
4.8%
56
 
4.5%
25
 
2.0%
24
 
1.9%
24
 
1.9%
22
 
1.8%
22
 
1.8%
Other values (150) 664
53.5%
Common
ValueCountFrequency (%)
) 189
47.1%
( 188
46.9%
16
 
4.0%
- 8
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1240
75.6%
ASCII 401
 
24.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 189
47.1%
( 188
46.9%
16
 
4.0%
- 8
 
2.0%
Hangul
ValueCountFrequency (%)
166
 
13.4%
92
 
7.4%
85
 
6.9%
60
 
4.8%
56
 
4.5%
25
 
2.0%
24
 
1.9%
24
 
1.9%
22
 
1.8%
22
 
1.8%
Other values (150) 664
53.5%
Distinct168
Distinct (%)81.6%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-11T09:27:42.759776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length41
Mean length27.81068
Min length19

Characters and Unicode

Total characters5729
Distinct characters225
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique130 ?
Unique (%)63.1%

Sample

1st row경상남도 창원시 의창구 북면 천주로 991-22
2nd row경상남도 창원시 마산회원구 내서읍 중리상곡로 104 ,603호(쌍봉빌딩)
3rd row경상남도 창원시 진해구 충장로 147 (여좌동)
4th row경상남도 창원시 마산회원구 북성로 313 , 202-1호 (회성동)
5th row경상남도 창원시 마산합포구 무학로 480 (교방동,동광골재)
ValueCountFrequency (%)
경상남도 207
 
17.4%
창원시 36
 
3.0%
김해시 30
 
2.5%
양산시 21
 
1.8%
14
 
1.2%
진주시 13
 
1.1%
한림면 13
 
1.1%
마산회원구 12
 
1.0%
함안군 12
 
1.0%
밀양시 11
 
0.9%
Other values (477) 820
69.0%
2023-12-11T09:27:43.291240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1043
 
18.2%
241
 
4.2%
227
 
4.0%
227
 
4.0%
226
 
3.9%
1 183
 
3.2%
2 149
 
2.6%
145
 
2.5%
120
 
2.1%
110
 
1.9%
Other values (215) 3058
53.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3422
59.7%
Space Separator 1043
 
18.2%
Decimal Number 910
 
15.9%
Dash Punctuation 107
 
1.9%
Open Punctuation 104
 
1.8%
Close Punctuation 104
 
1.8%
Other Punctuation 33
 
0.6%
Other Symbol 2
 
< 0.1%
Uppercase Letter 2
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
241
 
7.0%
227
 
6.6%
227
 
6.6%
226
 
6.6%
145
 
4.2%
120
 
3.5%
110
 
3.2%
102
 
3.0%
96
 
2.8%
84
 
2.5%
Other values (194) 1844
53.9%
Decimal Number
ValueCountFrequency (%)
1 183
20.1%
2 149
16.4%
3 106
11.6%
0 92
10.1%
6 81
8.9%
7 73
 
8.0%
5 66
 
7.3%
4 60
 
6.6%
9 51
 
5.6%
8 49
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 28
84.8%
. 5
 
15.2%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
L 1
50.0%
Space Separator
ValueCountFrequency (%)
1043
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 107
100.0%
Open Punctuation
ValueCountFrequency (%)
( 104
100.0%
Close Punctuation
ValueCountFrequency (%)
) 104
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3422
59.7%
Common 2305
40.2%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
241
 
7.0%
227
 
6.6%
227
 
6.6%
226
 
6.6%
145
 
4.2%
120
 
3.5%
110
 
3.2%
102
 
3.0%
96
 
2.8%
84
 
2.5%
Other values (194) 1844
53.9%
Common
ValueCountFrequency (%)
1043
45.2%
1 183
 
7.9%
2 149
 
6.5%
- 107
 
4.6%
3 106
 
4.6%
( 104
 
4.5%
) 104
 
4.5%
0 92
 
4.0%
6 81
 
3.5%
7 73
 
3.2%
Other values (9) 263
 
11.4%
Latin
ValueCountFrequency (%)
B 1
50.0%
L 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3422
59.7%
ASCII 2305
40.2%
CJK Compat 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1043
45.2%
1 183
 
7.9%
2 149
 
6.5%
- 107
 
4.6%
3 106
 
4.6%
( 104
 
4.5%
) 104
 
4.5%
0 92
 
4.0%
6 81
 
3.5%
7 73
 
3.2%
Other values (10) 263
 
11.4%
Hangul
ValueCountFrequency (%)
241
 
7.0%
227
 
6.6%
227
 
6.6%
226
 
6.6%
145
 
4.2%
120
 
3.5%
110
 
3.2%
102
 
3.0%
96
 
2.8%
84
 
2.5%
Other values (194) 1844
53.9%
CJK Compat
ValueCountFrequency (%)
2
100.0%

전화번호
Text

MISSING 

Distinct150
Distinct (%)75.4%
Missing7
Missing (%)3.4%
Memory size1.7 KiB
2023-12-11T09:27:43.549917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters2388
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)50.8%

Sample

1st row031-998-7881
2nd row055-231-6002
3rd row033-635-5022
4th row055-251-9993
5th row055-247-3066
ValueCountFrequency (%)
055-583-5885 2
 
1.0%
055-391-6500 2
 
1.0%
055-326-9123 2
 
1.0%
055-649-2813 2
 
1.0%
055-346-1100 2
 
1.0%
055-587-3737 2
 
1.0%
055-973-8512 2
 
1.0%
055-745-3327 2
 
1.0%
055-758-0028 2
 
1.0%
055-744-4031 2
 
1.0%
Other values (140) 179
89.9%
2023-12-11T09:27:43.943716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 549
23.0%
- 398
16.7%
0 345
14.4%
3 190
 
8.0%
2 165
 
6.9%
8 146
 
6.1%
4 137
 
5.7%
7 124
 
5.2%
6 123
 
5.2%
9 106
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1990
83.3%
Dash Punctuation 398
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 549
27.6%
0 345
17.3%
3 190
 
9.5%
2 165
 
8.3%
8 146
 
7.3%
4 137
 
6.9%
7 124
 
6.2%
6 123
 
6.2%
9 106
 
5.3%
1 105
 
5.3%
Dash Punctuation
ValueCountFrequency (%)
- 398
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2388
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 549
23.0%
- 398
16.7%
0 345
14.4%
3 190
 
8.0%
2 165
 
6.9%
8 146
 
6.1%
4 137
 
5.7%
7 124
 
5.2%
6 123
 
5.2%
9 106
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2388
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 549
23.0%
- 398
16.7%
0 345
14.4%
3 190
 
8.0%
2 165
 
6.9%
8 146
 
6.1%
4 137
 
5.7%
7 124
 
5.2%
6 123
 
5.2%
9 106
 
4.4%

Interactions

2023-12-11T09:27:41.106731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:27:44.025418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0001.000
구분1.0001.000
2023-12-11T09:27:44.339249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.980
구분0.9801.000

Missing values

2023-12-11T09:27:41.221855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:27:41.320930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분업체명소재지전화번호
01수집운반업현대환경개발(주)경상남도 창원시 의창구 북면 천주로 991-22031-998-7881
12수집운반업(주)현대자원경상남도 창원시 마산회원구 내서읍 중리상곡로 104 ,603호(쌍봉빌딩)055-231-6002
23수집운반업명성환경경상남도 창원시 진해구 충장로 147 (여좌동)033-635-5022
34수집운반업(주)아시아환경경상남도 창원시 마산회원구 북성로 313 , 202-1호 (회성동)055-251-9993
45수집운반업동광골재환경경상남도 창원시 마산합포구 무학로 480 (교방동,동광골재)055-247-3066
56수집운반업정우환경개발(창원)경상남도 창원시 소계동 창원시 소계동 130-200번지055-298-2662
67수집운반업완월환경골재경상남도 마산시 완월동 229-1055-248-5220
78수집운반업(주)대득건설(운반자)경상남도 창원시 의창구 도계두리길 108-11 (도계동)055-277-0306
89수집운반업형제골재환경경상남도 창원시 동정동 222-3055-252-0606
910수집운반업푸른환경개발(마산)경상남도 창원시 마산회원구 무학로 562 (회원동)055-243-4343
연번구분업체명소재지전화번호
196197중간처분업(주)동영환경경상남도 산청군 오부면 산수로 424-43055-973-8512
197198중간처분업일신환경(주)(산청)경상남도 산청군 신안면 하정리 457055-973-7500
198199중간처분업(주)승안환경경상남도 함양군 수동면 구라길 95 (주)승안환경055-964-4000
199200중간처분업(주)은창환경경상남도 함양군 휴천면 목현옥매로 209-1 (주)은창환경_처리장055-962-5600
200201중간처분업남양기업(주)경상남도 함양군 휴천면 호산리 680번지055-962-5580
201202중간처분업케이디산업주식회사경상남도 함양군 유림면 목현옥매로 306 (옥매리)055-964-7890
202203중간처분업(주)한국크락샤경상남도 거창군 위천면 원당2길 150055-945-0667
203204중간처분업(주)상원엔텍경상남도 합천군 율곡면 황강옥전로 292-3 상원엔텍055-932-9200
204205중간처분업(주)성지이테크경상남도 합천군 율곡면 두사리 300번지053-631-3999
205206중간처분업(주)초계산업경상남도 합천군 적중면 황강옥전로 1861055-931-7844