Overview

Dataset statistics

Number of variables4
Number of observations260
Missing cells15
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.5 KiB
Average record size in memory33.5 B

Variable types

Numeric1
Text3

Dataset

Description2023년 2분기 기준 폐기물 관리법 제17조 근거 1일 300키로 이상 사업장폐기물배출자 신고 현황으로 소재지 업체명 연락처를 기록하였음.
URLhttps://www.data.go.kr/data/15060367/fileData.do

Alerts

연락처 has 15 (5.8%) missing valuesMissing
연번 has unique valuesUnique
사업장명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:55:50.134539
Analysis finished2023-12-12 06:55:50.754883
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct260
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean130.5
Minimum1
Maximum260
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2023-12-12T15:55:50.834257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.95
Q165.75
median130.5
Q3195.25
95-th percentile247.05
Maximum260
Range259
Interquartile range (IQR)129.5

Descriptive statistics

Standard deviation75.199734
Coefficient of variation (CV)0.57624317
Kurtosis-1.2
Mean130.5
Median Absolute Deviation (MAD)65
Skewness0
Sum33930
Variance5655
MonotonicityStrictly increasing
2023-12-12T15:55:51.024875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
165 1
 
0.4%
167 1
 
0.4%
168 1
 
0.4%
169 1
 
0.4%
170 1
 
0.4%
171 1
 
0.4%
172 1
 
0.4%
173 1
 
0.4%
174 1
 
0.4%
Other values (250) 250
96.2%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
260 1
0.4%
259 1
0.4%
258 1
0.4%
257 1
0.4%
256 1
0.4%
255 1
0.4%
254 1
0.4%
253 1
0.4%
252 1
0.4%
251 1
0.4%

사업장명
Text

UNIQUE 

Distinct260
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T15:55:51.267985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length22
Mean length9.4846154
Min length3

Characters and Unicode

Total characters2466
Distinct characters338
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique260 ?
Unique (%)100.0%

Sample

1st row㈜파라다이스호텔부산
2nd row㈜조선호텔앤리조트 부산
3rd row(주)세이브존리베라
4th row오션타워 운영회의
5th row삼성생명㈜ 해운대연수소
ValueCountFrequency (%)
주식회사 8
 
2.4%
관리위원회 5
 
1.5%
해운대점 3
 
0.9%
관리사무소 3
 
0.9%
부산 3
 
0.9%
관리단 3
 
0.9%
해운대센텀호텔 2
 
0.6%
운영위원회 2
 
0.6%
대표회의 2
 
0.6%
정우빌딩 2
 
0.6%
Other values (297) 300
90.1%
2023-12-12T15:55:51.655856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
74
 
3.0%
73
 
3.0%
59
 
2.4%
59
 
2.4%
58
 
2.4%
57
 
2.3%
53
 
2.1%
53
 
2.1%
52
 
2.1%
50
 
2.0%
Other values (328) 1878
76.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2243
91.0%
Space Separator 74
 
3.0%
Other Symbol 50
 
2.0%
Close Punctuation 24
 
1.0%
Open Punctuation 24
 
1.0%
Uppercase Letter 19
 
0.8%
Decimal Number 16
 
0.6%
Lowercase Letter 13
 
0.5%
Other Punctuation 2
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
73
 
3.3%
59
 
2.6%
59
 
2.6%
58
 
2.6%
57
 
2.5%
53
 
2.4%
53
 
2.4%
52
 
2.3%
50
 
2.2%
46
 
2.1%
Other values (294) 1683
75.0%
Uppercase Letter
ValueCountFrequency (%)
H 2
10.5%
S 2
10.5%
N 2
10.5%
A 2
10.5%
C 2
10.5%
O 1
 
5.3%
T 1
 
5.3%
K 1
 
5.3%
E 1
 
5.3%
L 1
 
5.3%
Other values (4) 4
21.1%
Lowercase Letter
ValueCountFrequency (%)
s 3
23.1%
i 3
23.1%
e 3
23.1%
b 1
 
7.7%
k 1
 
7.7%
h 1
 
7.7%
t 1
 
7.7%
Decimal Number
ValueCountFrequency (%)
1 5
31.2%
2 5
31.2%
7 2
 
12.5%
3 2
 
12.5%
0 1
 
6.2%
6 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 1
50.0%
/ 1
50.0%
Space Separator
ValueCountFrequency (%)
74
100.0%
Other Symbol
ValueCountFrequency (%)
50
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2293
93.0%
Common 141
 
5.7%
Latin 32
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
73
 
3.2%
59
 
2.6%
59
 
2.6%
58
 
2.5%
57
 
2.5%
53
 
2.3%
53
 
2.3%
52
 
2.3%
50
 
2.2%
50
 
2.2%
Other values (295) 1729
75.4%
Latin
ValueCountFrequency (%)
s 3
 
9.4%
i 3
 
9.4%
e 3
 
9.4%
H 2
 
6.2%
S 2
 
6.2%
N 2
 
6.2%
A 2
 
6.2%
C 2
 
6.2%
b 1
 
3.1%
O 1
 
3.1%
Other values (11) 11
34.4%
Common
ValueCountFrequency (%)
74
52.5%
) 24
 
17.0%
( 24
 
17.0%
1 5
 
3.5%
2 5
 
3.5%
7 2
 
1.4%
3 2
 
1.4%
0 1
 
0.7%
, 1
 
0.7%
6 1
 
0.7%
Other values (2) 2
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2243
91.0%
ASCII 173
 
7.0%
None 50
 
2.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
74
42.8%
) 24
 
13.9%
( 24
 
13.9%
1 5
 
2.9%
2 5
 
2.9%
s 3
 
1.7%
i 3
 
1.7%
e 3
 
1.7%
H 2
 
1.2%
S 2
 
1.2%
Other values (23) 28
 
16.2%
Hangul
ValueCountFrequency (%)
73
 
3.3%
59
 
2.6%
59
 
2.6%
58
 
2.6%
57
 
2.5%
53
 
2.4%
53
 
2.4%
52
 
2.3%
50
 
2.2%
46
 
2.1%
Other values (294) 1683
75.0%
None
ValueCountFrequency (%)
50
100.0%

주소
Text

Distinct254
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T15:55:51.955137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length32
Mean length22.338462
Min length8

Characters and Unicode

Total characters5808
Distinct characters113
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique250 ?
Unique (%)96.2%

Sample

1st row부산광역시 해운대구 해운대해변로 296(중동)
2nd row부산광역시 해운대구 동백로 67(우동)
3rd row부산광역시 해운대구 구남로29번길 21(중동)
4th row부산광역시 해운대구 해운대해변로 203(우동)
5th row부산광역시 해운대구 좌동순환로468번가길 5-11(중동)
ValueCountFrequency (%)
부산광역시 246
23.9%
해운대구 245
23.8%
해운대로 28
 
2.7%
해운대해변로 21
 
2.0%
좌동순환로 18
 
1.8%
센텀중앙로 10
 
1.0%
양운로 9
 
0.9%
수영강변대로 8
 
0.8%
센텀동로 8
 
0.8%
세실로 8
 
0.8%
Other values (319) 427
41.5%
2023-12-12T15:55:52.726912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
772
 
13.3%
351
 
6.0%
337
 
5.8%
333
 
5.7%
258
 
4.4%
253
 
4.4%
247
 
4.3%
247
 
4.3%
247
 
4.3%
246
 
4.2%
Other values (103) 2517
43.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3822
65.8%
Decimal Number 871
 
15.0%
Space Separator 772
 
13.3%
Open Punctuation 144
 
2.5%
Close Punctuation 144
 
2.5%
Dash Punctuation 21
 
0.4%
Other Punctuation 17
 
0.3%
Uppercase Letter 16
 
0.3%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
351
 
9.2%
337
 
8.8%
333
 
8.7%
258
 
6.8%
253
 
6.6%
247
 
6.5%
247
 
6.5%
247
 
6.5%
246
 
6.4%
246
 
6.4%
Other values (82) 1057
27.7%
Decimal Number
ValueCountFrequency (%)
1 147
16.9%
2 135
15.5%
6 89
10.2%
7 88
10.1%
3 85
9.8%
5 74
8.5%
4 71
8.2%
8 63
7.2%
9 61
7.0%
0 58
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
P 4
25.0%
A 4
25.0%
E 4
25.0%
C 4
25.0%
Other Punctuation
ValueCountFrequency (%)
, 16
94.1%
. 1
 
5.9%
Space Separator
ValueCountFrequency (%)
772
100.0%
Open Punctuation
ValueCountFrequency (%)
( 144
100.0%
Close Punctuation
ValueCountFrequency (%)
) 144
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Lowercase Letter
ValueCountFrequency (%)
s 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3822
65.8%
Common 1969
33.9%
Latin 17
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
351
 
9.2%
337
 
8.8%
333
 
8.7%
258
 
6.8%
253
 
6.6%
247
 
6.5%
247
 
6.5%
247
 
6.5%
246
 
6.4%
246
 
6.4%
Other values (82) 1057
27.7%
Common
ValueCountFrequency (%)
772
39.2%
1 147
 
7.5%
( 144
 
7.3%
) 144
 
7.3%
2 135
 
6.9%
6 89
 
4.5%
7 88
 
4.5%
3 85
 
4.3%
5 74
 
3.8%
4 71
 
3.6%
Other values (6) 220
 
11.2%
Latin
ValueCountFrequency (%)
P 4
23.5%
A 4
23.5%
E 4
23.5%
C 4
23.5%
s 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3822
65.8%
ASCII 1986
34.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
772
38.9%
1 147
 
7.4%
( 144
 
7.3%
) 144
 
7.3%
2 135
 
6.8%
6 89
 
4.5%
7 88
 
4.4%
3 85
 
4.3%
5 74
 
3.7%
4 71
 
3.6%
Other values (11) 237
 
11.9%
Hangul
ValueCountFrequency (%)
351
 
9.2%
337
 
8.8%
333
 
8.7%
258
 
6.8%
253
 
6.6%
247
 
6.5%
247
 
6.5%
247
 
6.5%
246
 
6.4%
246
 
6.4%
Other values (82) 1057
27.7%

연락처
Text

MISSING 

Distinct243
Distinct (%)99.2%
Missing15
Missing (%)5.8%
Memory size2.2 KiB
2023-12-12T15:55:53.001252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.979592
Min length8

Characters and Unicode

Total characters2935
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique241 ?
Unique (%)98.4%

Sample

1st row051-749-2638
2nd row051-749-7499
3rd row051-740-9032
4th row051-740-5036
5th row051-740-1415
ValueCountFrequency (%)
051-780-2599 2
 
0.8%
051-819-7735 2
 
0.8%
051-523-1225 1
 
0.4%
070-4166-4575 1
 
0.4%
051-509-8124 1
 
0.4%
051-730-8803 1
 
0.4%
051-749-2638 1
 
0.4%
051-747-5451 1
 
0.4%
051-746-2041 1
 
0.4%
051-747-9451 1
 
0.4%
Other values (233) 233
95.1%
2023-12-12T15:55:53.438075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 512
17.4%
- 488
16.6%
1 419
14.3%
5 385
13.1%
7 308
10.5%
4 203
 
6.9%
2 154
 
5.2%
9 125
 
4.3%
6 120
 
4.1%
3 118
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2447
83.4%
Dash Punctuation 488
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 512
20.9%
1 419
17.1%
5 385
15.7%
7 308
12.6%
4 203
 
8.3%
2 154
 
6.3%
9 125
 
5.1%
6 120
 
4.9%
3 118
 
4.8%
8 103
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 488
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2935
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 512
17.4%
- 488
16.6%
1 419
14.3%
5 385
13.1%
7 308
10.5%
4 203
 
6.9%
2 154
 
5.2%
9 125
 
4.3%
6 120
 
4.1%
3 118
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2935
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 512
17.4%
- 488
16.6%
1 419
14.3%
5 385
13.1%
7 308
10.5%
4 203
 
6.9%
2 154
 
5.2%
9 125
 
4.3%
6 120
 
4.1%
3 118
 
4.0%

Interactions

2023-12-12T15:55:50.466256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T15:55:50.590191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:55:50.711556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명주소연락처
01㈜파라다이스호텔부산부산광역시 해운대구 해운대해변로 296(중동)051-749-2638
12㈜조선호텔앤리조트 부산부산광역시 해운대구 동백로 67(우동)051-749-7499
23(주)세이브존리베라부산광역시 해운대구 구남로29번길 21(중동)051-740-9032
34오션타워 운영회의부산광역시 해운대구 해운대해변로 203(우동)051-740-5036
45삼성생명㈜ 해운대연수소부산광역시 해운대구 좌동순환로468번가길 5-11(중동)051-740-1415
56㈜서원유통 탑마트 반여점부산광역시 해운대구 선수촌로 119(반여동)051-525-0422
67㈜이마트부산광역시 해운대구 좌동순환로 511(중동 1767)051-608-1054
78홈플러스(주)해운대점부산광역시 해운대구 해운대해변로 140(우동)051-747-8514
89송정호텔㈜파인프라자부산광역시 해운대구 송정해변로 28(송정동)051-702-7766
910주식회사 우일서브부산광역시 해운대구 해운대해변로 257051-740-5498
연번사업장명주소연락처
250251해운대센텀호텔 관리단센텀3로 20(우동)720-9077
251252㈜대일인터내셔널하스피탈리티그룹해운대지점구남로 9(우동)051-610-3070
252253태광산업㈜반여공장선수촌로 202-13(반여동)051-522-4031
253254바른길병원선수촌로 144051-525-0058
254255주식회사 티아처해운대로76번길 16070-4166-4575
255256㈜성철사반여로41번길 36051-523-1225
256257씨제이프레시웨이주식회사선수촌로 230<NA>
257258에이치스위트해운대관리위원회해운대로 601051-747-9696
258259에스텍시스템달맞이길 30, 포디움동 2층051-747-9931
259260㈜케이알서비스(동부센트레빌 플래비뉴)해운대로108번길 22051-784-9211