Overview

Dataset statistics

Number of variables5
Number of observations176
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.2 KiB
Average record size in memory41.8 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description경상남도 내 건설폐기물 처리업체(수집운반업 및 중간처분업)에 관한 현황입니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3084077

Alerts

연번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:27:45.517519
Analysis finished2023-12-11 00:27:46.094401
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct176
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean88.5
Minimum1
Maximum176
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-11T09:27:46.194807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9.75
Q144.75
median88.5
Q3132.25
95-th percentile167.25
Maximum176
Range175
Interquartile range (IQR)87.5

Descriptive statistics

Standard deviation50.950957
Coefficient of variation (CV)0.57571703
Kurtosis-1.2
Mean88.5
Median Absolute Deviation (MAD)44
Skewness0
Sum15576
Variance2596
MonotonicityStrictly increasing
2023-12-11T09:27:46.346295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
90 1
 
0.6%
114 1
 
0.6%
115 1
 
0.6%
116 1
 
0.6%
117 1
 
0.6%
118 1
 
0.6%
119 1
 
0.6%
120 1
 
0.6%
121 1
 
0.6%
Other values (166) 166
94.3%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
176 1
0.6%
175 1
0.6%
174 1
0.6%
173 1
0.6%
172 1
0.6%
171 1
0.6%
170 1
0.6%
169 1
0.6%
168 1
0.6%
167 1
0.6%

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
수집운반업
126 
중간처리업
50 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수집운반업
2nd row수집운반업
3rd row수집운반업
4th row수집운반업
5th row수집운반업

Common Values

ValueCountFrequency (%)
수집운반업 126
71.6%
중간처리업 50
 
28.4%

Length

2023-12-11T09:27:46.475177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:27:46.600694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수집운반업 126
71.6%
중간처리업 50
 
28.4%
Distinct136
Distinct (%)77.3%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-11T09:27:46.888192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length7.9488636
Min length3

Characters and Unicode

Total characters1399
Distinct characters153
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique96 ?
Unique (%)54.5%

Sample

1st row형제골재환경
2nd row동광골재환경
3rd row(주)진산기업
4th row아신엔지니어링(주)
5th row완월환경골재
ValueCountFrequency (%)
주식회사 4
 
2.2%
주)중앙환경(김해 2
 
1.1%
동건환경(주 2
 
1.1%
에코시스템(주 2
 
1.1%
주)구룡 2
 
1.1%
일신환경(주)(산청 2
 
1.1%
금광개발(주 2
 
1.1%
주)경부이엔티 2
 
1.1%
삼삼환경(주 2
 
1.1%
대진환경(주)(함안 2
 
1.1%
Other values (129) 160
87.9%
2023-12-11T09:27:47.310681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 169
 
12.1%
( 168
 
12.0%
137
 
9.8%
86
 
6.1%
81
 
5.8%
50
 
3.6%
48
 
3.4%
22
 
1.6%
22
 
1.6%
17
 
1.2%
Other values (143) 599
42.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1049
75.0%
Close Punctuation 169
 
12.1%
Open Punctuation 168
 
12.0%
Dash Punctuation 7
 
0.5%
Space Separator 6
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
137
 
13.1%
86
 
8.2%
81
 
7.7%
50
 
4.8%
48
 
4.6%
22
 
2.1%
22
 
2.1%
17
 
1.6%
16
 
1.5%
16
 
1.5%
Other values (139) 554
52.8%
Close Punctuation
ValueCountFrequency (%)
) 169
100.0%
Open Punctuation
ValueCountFrequency (%)
( 168
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1049
75.0%
Common 350
 
25.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
137
 
13.1%
86
 
8.2%
81
 
7.7%
50
 
4.8%
48
 
4.6%
22
 
2.1%
22
 
2.1%
17
 
1.6%
16
 
1.5%
16
 
1.5%
Other values (139) 554
52.8%
Common
ValueCountFrequency (%)
) 169
48.3%
( 168
48.0%
- 7
 
2.0%
6
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1049
75.0%
ASCII 350
 
25.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 169
48.3%
( 168
48.0%
- 7
 
2.0%
6
 
1.7%
Hangul
ValueCountFrequency (%)
137
 
13.1%
86
 
8.2%
81
 
7.7%
50
 
4.8%
48
 
4.6%
22
 
2.1%
22
 
2.1%
17
 
1.6%
16
 
1.5%
16
 
1.5%
Other values (139) 554
52.8%
Distinct137
Distinct (%)77.8%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-11T09:27:47.664377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length39
Mean length26.386364
Min length18

Characters and Unicode

Total characters4644
Distinct characters219
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)55.7%

Sample

1st row경상남도 창원시 동정동 222-3
2nd row경상남도 창원시 마산합포구 무학로 480 (교방동,동광골재)
3rd row경상남도 창원시 마산합포구 교방동 365-23
4th row경상남도 창원시 마산합포구 산호동 383-4번지 2층 아신엔지니어링(주)
5th row경상남도 창원시 마산합포구 완월동 229-1
ValueCountFrequency (%)
경상남도 176
 
18.1%
창원시 40
 
4.1%
김해시 26
 
2.7%
16
 
1.6%
양산시 15
 
1.5%
마산합포구 13
 
1.3%
진주시 12
 
1.2%
밀양시 9
 
0.9%
거창군 9
 
0.9%
마산회원구 9
 
0.9%
Other values (375) 649
66.6%
2023-12-11T09:27:48.199689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
853
 
18.4%
204
 
4.4%
189
 
4.1%
189
 
4.1%
188
 
4.0%
1 155
 
3.3%
121
 
2.6%
2 113
 
2.4%
102
 
2.2%
- 101
 
2.2%
Other values (209) 2429
52.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2802
60.3%
Space Separator 853
 
18.4%
Decimal Number 727
 
15.7%
Dash Punctuation 101
 
2.2%
Close Punctuation 64
 
1.4%
Open Punctuation 64
 
1.4%
Other Punctuation 27
 
0.6%
Other Symbol 2
 
< 0.1%
Lowercase Letter 2
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
204
 
7.3%
189
 
6.7%
189
 
6.7%
188
 
6.7%
121
 
4.3%
102
 
3.6%
92
 
3.3%
83
 
3.0%
76
 
2.7%
73
 
2.6%
Other values (188) 1485
53.0%
Decimal Number
ValueCountFrequency (%)
1 155
21.3%
2 113
15.5%
3 80
11.0%
0 72
9.9%
4 62
 
8.5%
7 57
 
7.8%
6 57
 
7.8%
5 54
 
7.4%
8 42
 
5.8%
9 35
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 18
66.7%
. 9
33.3%
Lowercase Letter
ValueCountFrequency (%)
l 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
853
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 101
100.0%
Close Punctuation
ValueCountFrequency (%)
) 64
100.0%
Open Punctuation
ValueCountFrequency (%)
( 64
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
T 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2802
60.3%
Common 1839
39.6%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
204
 
7.3%
189
 
6.7%
189
 
6.7%
188
 
6.7%
121
 
4.3%
102
 
3.6%
92
 
3.3%
83
 
3.0%
76
 
2.7%
73
 
2.6%
Other values (188) 1485
53.0%
Common
ValueCountFrequency (%)
853
46.4%
1 155
 
8.4%
2 113
 
6.1%
- 101
 
5.5%
3 80
 
4.4%
0 72
 
3.9%
) 64
 
3.5%
( 64
 
3.5%
4 62
 
3.4%
7 57
 
3.1%
Other values (8) 218
 
11.9%
Latin
ValueCountFrequency (%)
T 1
33.3%
l 1
33.3%
e 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2802
60.3%
ASCII 1840
39.6%
CJK Compat 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
853
46.4%
1 155
 
8.4%
2 113
 
6.1%
- 101
 
5.5%
3 80
 
4.3%
0 72
 
3.9%
) 64
 
3.5%
( 64
 
3.5%
4 62
 
3.4%
7 57
 
3.1%
Other values (10) 219
 
11.9%
Hangul
ValueCountFrequency (%)
204
 
7.3%
189
 
6.7%
189
 
6.7%
188
 
6.7%
121
 
4.3%
102
 
3.6%
92
 
3.3%
83
 
3.0%
76
 
2.7%
73
 
2.6%
Other values (188) 1485
53.0%
CJK Compat
ValueCountFrequency (%)
2
100.0%
Distinct129
Distinct (%)73.7%
Missing1
Missing (%)0.6%
Memory size1.5 KiB
2023-12-11T09:27:48.454940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.942857
Min length1

Characters and Unicode

Total characters2090
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique84 ?
Unique (%)48.0%

Sample

1st row055-266-7007
2nd row055-251-9993
3rd row055-271-0400
4th row055-223-2222
5th row055-247-3066
ValueCountFrequency (%)
055-367-1004 3
 
1.7%
055-343-7755 2
 
1.1%
055-864-7460 2
 
1.1%
055-855-1311 2
 
1.1%
055-745-3327 2
 
1.1%
055-973-8512 2
 
1.1%
055-339-8643 2
 
1.1%
055-586-2288 2
 
1.1%
055-587-7080 2
 
1.1%
055-884-5942 2
 
1.1%
Other values (119) 154
88.0%
2023-12-11T09:27:48.803039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 480
23.0%
- 349
16.7%
0 304
14.5%
3 175
 
8.4%
2 160
 
7.7%
8 118
 
5.6%
7 112
 
5.4%
4 108
 
5.2%
1 100
 
4.8%
6 97
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1741
83.3%
Dash Punctuation 349
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 480
27.6%
0 304
17.5%
3 175
 
10.1%
2 160
 
9.2%
8 118
 
6.8%
7 112
 
6.4%
4 108
 
6.2%
1 100
 
5.7%
6 97
 
5.6%
9 87
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 349
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2090
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 480
23.0%
- 349
16.7%
0 304
14.5%
3 175
 
8.4%
2 160
 
7.7%
8 118
 
5.6%
7 112
 
5.4%
4 108
 
5.2%
1 100
 
4.8%
6 97
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2090
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 480
23.0%
- 349
16.7%
0 304
14.5%
3 175
 
8.4%
2 160
 
7.7%
8 118
 
5.6%
7 112
 
5.4%
4 108
 
5.2%
1 100
 
4.8%
6 97
 
4.6%

Interactions

2023-12-11T09:27:45.819884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:27:48.891520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.998
구분0.9981.000
2023-12-11T09:27:48.961716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.940
구분0.9401.000

Missing values

2023-12-11T09:27:45.939965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:27:46.047884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분업소명소재지전화번호
01수집운반업형제골재환경경상남도 창원시 동정동 222-3055-266-7007
12수집운반업동광골재환경경상남도 창원시 마산합포구 무학로 480 (교방동,동광골재)055-251-9993
23수집운반업(주)진산기업경상남도 창원시 마산합포구 교방동 365-23055-271-0400
34수집운반업아신엔지니어링(주)경상남도 창원시 마산합포구 산호동 383-4번지 2층 아신엔지니어링(주)055-223-2222
45수집운반업완월환경골재경상남도 창원시 마산합포구 완월동 229-1055-247-3066
56수집운반업동일환경경상남도 창원시 마산합포구 월영동 708-2055-251-0358
67수집운반업조건사경상남도 창원시 마산합포구 자산동 24-2번지055-238-2477
78수집운반업주식회사 오성산업환경(운반자)경상남도 창원시 마산합포구 진동면 해양관광로 309-16055-251-4585
89수집운반업(주)청림건업경상남도 창원시 마산합포구 진북면 망곡리 망곡리 13-5055-210-3931
910수집운반업(주)푸른환경(마산)경상남도 창원시 마산합포구 진전면 오서리 106-1055-241-6733
연번구분업소명소재지전화번호
166167중간처리업(주)토지환경경상남도 하동군 양보면 지례리 193-14055-884-5942
167168중간처리업일신환경(주)(산청)경상남도 산청군 신안면 하정리 457055-973-8512
168169중간처리업(주)동영환경경상남도 산청군 오부면 양촌리 231-1055-973-7500
169170중간처리업(주)승안환경경상남도 함양군 수동면 구라길 95 (주)승안환경055-964-4000
170171중간처리업남양기업(주)경상남도 거창군 휴천면 호산리 680번지055-962-5600
171172중간처리업(주)은창환경경상남도 거창군 휴천면 목현옥매로 209-1 (주)은창환경_처리장055-962-5580
172173중간처리업(주)한국크락샤경상남도 거창군 위천면 원당2길 150055-945-0667
173174중간처리업(주)성지이테크경상남도 합천군 율곡면 두사리 300번지055-932-9200
174175중간처리업(주)초계산업경상남도 합천군 적중면 황강옥전로 1861 .055-933-3999
175176중간처리업(주)상원엔텍경상남도 합천군 율곡면 두사리 275번지<NA>