Overview

Dataset statistics

Number of variables5
Number of observations40
Missing cells1
Missing cells (%)0.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory44.3 B

Variable types

Numeric1
Text4

Dataset

Description강원도 태백시 사업장페기물배출자신고현황에 대한 데이터로 사업장폐기물배출 사업장의 상호, 주소, 전화번호 등의 정보를 제공합니다.
Author강원특별자치도 태백시
URLhttps://www.data.go.kr/data/15060104/fileData.do

Alerts

전화번호 has 1 (2.5%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:27:41.210129
Analysis finished2023-12-12 10:27:41.785045
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct40
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.5
Minimum1
Maximum40
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size492.0 B
2023-12-12T19:27:41.859576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.95
Q110.75
median20.5
Q330.25
95-th percentile38.05
Maximum40
Range39
Interquartile range (IQR)19.5

Descriptive statistics

Standard deviation11.690452
Coefficient of variation (CV)0.57026595
Kurtosis-1.2
Mean20.5
Median Absolute Deviation (MAD)10
Skewness0
Sum820
Variance136.66667
MonotonicityStrictly increasing
2023-12-12T19:27:41.991790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
1 1
 
2.5%
22 1
 
2.5%
24 1
 
2.5%
25 1
 
2.5%
26 1
 
2.5%
27 1
 
2.5%
28 1
 
2.5%
29 1
 
2.5%
30 1
 
2.5%
31 1
 
2.5%
Other values (30) 30
75.0%
ValueCountFrequency (%)
1 1
2.5%
2 1
2.5%
3 1
2.5%
4 1
2.5%
5 1
2.5%
6 1
2.5%
7 1
2.5%
8 1
2.5%
9 1
2.5%
10 1
2.5%
ValueCountFrequency (%)
40 1
2.5%
39 1
2.5%
38 1
2.5%
37 1
2.5%
36 1
2.5%
35 1
2.5%
34 1
2.5%
33 1
2.5%
32 1
2.5%
31 1
2.5%
Distinct39
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
2023-12-12T19:27:42.196618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.925
Min length7

Characters and Unicode

Total characters397
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)95.0%

Sample

1st row2022-10-27
2nd row2022-01-01
3rd row2021-03-01
4th row2021-02-01
5th row2021-01-01
ValueCountFrequency (%)
1998-01-01 2
 
5.0%
2011-05-01 1
 
2.5%
2011-03-01 1
 
2.5%
2011-02-01 1
 
2.5%
2010-03-01 1
 
2.5%
2010-01-01 1
 
2.5%
2009-03-01 1
 
2.5%
2009-02-01 1
 
2.5%
2009-01-01 1
 
2.5%
2011-04-01 1
 
2.5%
Other values (29) 29
72.5%
2023-12-12T19:27:42.486989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 129
32.5%
1 82
20.7%
- 79
19.9%
2 62
15.6%
9 14
 
3.5%
8 9
 
2.3%
3 8
 
2.0%
4 4
 
1.0%
7 4
 
1.0%
6 3
 
0.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 318
80.1%
Dash Punctuation 79
 
19.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 129
40.6%
1 82
25.8%
2 62
19.5%
9 14
 
4.4%
8 9
 
2.8%
3 8
 
2.5%
4 4
 
1.3%
7 4
 
1.3%
6 3
 
0.9%
5 3
 
0.9%
Dash Punctuation
ValueCountFrequency (%)
- 79
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 397
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 129
32.5%
1 82
20.7%
- 79
19.9%
2 62
15.6%
9 14
 
3.5%
8 9
 
2.3%
3 8
 
2.0%
4 4
 
1.0%
7 4
 
1.0%
6 3
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 397
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 129
32.5%
1 82
20.7%
- 79
19.9%
2 62
15.6%
9 14
 
3.5%
8 9
 
2.3%
3 8
 
2.0%
4 4
 
1.0%
7 4
 
1.0%
6 3
 
0.8%

상호
Text

Distinct36
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
2023-12-12T19:27:42.713339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length9.6
Min length4

Characters and Unicode

Total characters384
Distinct characters102
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)85.0%

Sample

1st row주식회사 태백산김치
2nd row한국광해광업공단 강원지사
3rd row(주)한얼싸이언스
4th row코레일테크(주)
5th row농업회사법인 주식회사 태백
ValueCountFrequency (%)
강원지사 5
 
8.6%
한국광해관리공단 4
 
6.9%
주식회사 4
 
6.9%
근로복지공단 2
 
3.4%
태백병원 2
 
3.4%
오투리조트 1
 
1.7%
장성광업소 1
 
1.7%
주)고려노벨화약 1
 
1.7%
태백산김치 1
 
1.7%
주)태백레미콘 1
 
1.7%
Other values (36) 36
62.1%
2023-12-12T19:27:43.143001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
 
5.2%
20
 
5.2%
18
 
4.7%
18
 
4.7%
) 16
 
4.2%
( 16
 
4.2%
16
 
4.2%
13
 
3.4%
11
 
2.9%
11
 
2.9%
Other values (92) 225
58.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 334
87.0%
Space Separator 18
 
4.7%
Close Punctuation 16
 
4.2%
Open Punctuation 16
 
4.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
6.0%
20
 
6.0%
18
 
5.4%
16
 
4.8%
13
 
3.9%
11
 
3.3%
11
 
3.3%
10
 
3.0%
9
 
2.7%
9
 
2.7%
Other values (89) 197
59.0%
Space Separator
ValueCountFrequency (%)
18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 334
87.0%
Common 50
 
13.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
6.0%
20
 
6.0%
18
 
5.4%
16
 
4.8%
13
 
3.9%
11
 
3.3%
11
 
3.3%
10
 
3.0%
9
 
2.7%
9
 
2.7%
Other values (89) 197
59.0%
Common
ValueCountFrequency (%)
18
36.0%
) 16
32.0%
( 16
32.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 334
87.0%
ASCII 50
 
13.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
20
 
6.0%
20
 
6.0%
18
 
5.4%
16
 
4.8%
13
 
3.9%
11
 
3.3%
11
 
3.3%
10
 
3.0%
9
 
2.7%
9
 
2.7%
Other values (89) 197
59.0%
ASCII
ValueCountFrequency (%)
18
36.0%
) 16
32.0%
( 16
32.0%

전화번호
Text

MISSING 

Distinct34
Distinct (%)87.2%
Missing1
Missing (%)2.5%
Memory size452.0 B
2023-12-12T19:27:43.384606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters468
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)76.9%

Sample

1st row033-582-9838
2nd row033-550-9136
3rd row033-930-7700
4th row042-220-3305
5th row033-582-9881
ValueCountFrequency (%)
033-550-9134 3
 
7.7%
033-553-2897 2
 
5.1%
033-580-3252 2
 
5.1%
033-553-2755 2
 
5.1%
033-582-9838 1
 
2.6%
033-580-1234 1
 
2.6%
033-552-9005 1
 
2.6%
033-581-0190 1
 
2.6%
033-580-7215 1
 
2.6%
033-553-9622 1
 
2.6%
Other values (24) 24
61.5%
2023-12-12T19:27:43.782342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 100
21.4%
5 82
17.5%
- 78
16.7%
0 71
15.2%
8 28
 
6.0%
2 28
 
6.0%
1 26
 
5.6%
9 20
 
4.3%
4 14
 
3.0%
7 14
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 390
83.3%
Dash Punctuation 78
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 100
25.6%
5 82
21.0%
0 71
18.2%
8 28
 
7.2%
2 28
 
7.2%
1 26
 
6.7%
9 20
 
5.1%
4 14
 
3.6%
7 14
 
3.6%
6 7
 
1.8%
Dash Punctuation
ValueCountFrequency (%)
- 78
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 468
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 100
21.4%
5 82
17.5%
- 78
16.7%
0 71
15.2%
8 28
 
6.0%
2 28
 
6.0%
1 26
 
5.6%
9 20
 
4.3%
4 14
 
3.0%
7 14
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 468
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 100
21.4%
5 82
17.5%
- 78
16.7%
0 71
15.2%
8 28
 
6.0%
2 28
 
6.0%
1 26
 
5.6%
9 20
 
4.3%
4 14
 
3.0%
7 14
 
3.0%
Distinct39
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
2023-12-12T19:27:44.082039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length37
Mean length28.6
Min length20

Characters and Unicode

Total characters1144
Distinct characters101
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)95.0%

Sample

1st row강원특별자치도 태백시 태백로 2306-66 (동점동)
2nd row강원특별자치도 태백시 황지로 68_ 광해방지사업단 강원지사 (황지동)
3rd row강원특별자치도 태백시 철암공단길 16-15 (철암동)
4th row강원특별자치도 태백시 통동 152-6
5th row강원특별자치도 태백시 태백로 2306-70 (동점동)
ValueCountFrequency (%)
강원특별자치도 40
18.9%
태백시 40
18.9%
화전동 6
 
2.8%
태백로 6
 
2.8%
장성동 6
 
2.8%
철암동 5
 
2.4%
황지동 5
 
2.4%
백산동 5
 
2.4%
통동 4
 
1.9%
동점동 4
 
1.9%
Other values (70) 91
42.9%
2023-12-12T19:27:44.523843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
181
 
15.8%
56
 
4.9%
52
 
4.5%
50
 
4.4%
44
 
3.8%
44
 
3.8%
43
 
3.8%
43
 
3.8%
41
 
3.6%
40
 
3.5%
Other values (91) 550
48.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 744
65.0%
Space Separator 181
 
15.8%
Decimal Number 127
 
11.1%
Close Punctuation 35
 
3.1%
Open Punctuation 35
 
3.1%
Dash Punctuation 15
 
1.3%
Connector Punctuation 7
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
56
 
7.5%
52
 
7.0%
50
 
6.7%
44
 
5.9%
44
 
5.9%
43
 
5.8%
43
 
5.8%
41
 
5.5%
40
 
5.4%
40
 
5.4%
Other values (76) 291
39.1%
Decimal Number
ValueCountFrequency (%)
8 20
15.7%
1 16
12.6%
2 16
12.6%
6 15
11.8%
3 12
9.4%
9 11
8.7%
5 11
8.7%
0 10
7.9%
7 10
7.9%
4 6
 
4.7%
Space Separator
ValueCountFrequency (%)
181
100.0%
Close Punctuation
ValueCountFrequency (%)
) 35
100.0%
Open Punctuation
ValueCountFrequency (%)
( 35
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 744
65.0%
Common 400
35.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
56
 
7.5%
52
 
7.0%
50
 
6.7%
44
 
5.9%
44
 
5.9%
43
 
5.8%
43
 
5.8%
41
 
5.5%
40
 
5.4%
40
 
5.4%
Other values (76) 291
39.1%
Common
ValueCountFrequency (%)
181
45.2%
) 35
 
8.8%
( 35
 
8.8%
8 20
 
5.0%
1 16
 
4.0%
2 16
 
4.0%
6 15
 
3.8%
- 15
 
3.8%
3 12
 
3.0%
9 11
 
2.8%
Other values (5) 44
 
11.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 744
65.0%
ASCII 400
35.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
181
45.2%
) 35
 
8.8%
( 35
 
8.8%
8 20
 
5.0%
1 16
 
4.0%
2 16
 
4.0%
6 15
 
3.8%
- 15
 
3.8%
3 12
 
3.0%
9 11
 
2.8%
Other values (5) 44
 
11.0%
Hangul
ValueCountFrequency (%)
56
 
7.5%
52
 
7.0%
50
 
6.7%
44
 
5.9%
44
 
5.9%
43
 
5.8%
43
 
5.8%
41
 
5.5%
40
 
5.4%
40
 
5.4%
Other values (76) 291
39.1%

Interactions

2023-12-12T19:27:41.515814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:27:44.639544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번신고번호상호전화번호사업장도로명주소
연번1.0000.9320.9290.9671.000
신고번호0.9321.0000.9890.9870.994
상호0.9290.9891.0000.9950.979
전화번호0.9670.9870.9951.0001.000
사업장도로명주소1.0000.9940.9791.0001.000

Missing values

2023-12-12T19:27:41.638415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:27:41.745951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번신고번호상호전화번호사업장도로명주소
012022-10-27주식회사 태백산김치033-582-9838강원특별자치도 태백시 태백로 2306-66 (동점동)
122022-01-01한국광해광업공단 강원지사033-550-9136강원특별자치도 태백시 황지로 68_ 광해방지사업단 강원지사 (황지동)
232021-03-01(주)한얼싸이언스033-930-7700강원특별자치도 태백시 철암공단길 16-15 (철암동)
342021-02-01코레일테크(주)042-220-3305강원특별자치도 태백시 통동 152-6
452021-01-01농업회사법인 주식회사 태백033-582-9881강원특별자치도 태백시 태백로 2306-70 (동점동)
562020-04-01한국광해관리공단 강원지사033-550-9134강원특별자치도 태백시 장성동 29-7 연화 수실정화시설
672020-03-01한국광해관리공단 강원지사033-550-9134강원특별자치도 태백시 소도동 산 78 동해7갱 자연정화시설
782020-02-01한국광해관리공단 강원지사033-550-9134강원특별자치도 태백시 소도동 산 78 동해6갱 자연정화시설
892020-01-01(합)서진골재033-552-3005강원특별자치도 태백시 태백로 2149 (장성동)
9102019-02-01태백농업협동조합033-550-4780강원특별자치도 태백시 세곡길 7 (화전동)
연번신고번호상호전화번호사업장도로명주소
30311998-01-01근로복지공단 태백병원033-580-3252강원특별자치도 태백시 보드미길 8 (장성동)
31322008-04-01(주)이마트 태백점033-580-1234강원특별자치도 태백시 굴거랑길 4 (화전동)
32332007-18주식회사 대우레미콘033-554-4885강원특별자치도 태백시 솔안길 85 (통동)
33342007-04-01(주)새성도레미콘033-581-6455강원특별자치도 태백시 태백로 2034 (장성동)
34352007-02-01(주)삼양기업033-553-2897강원특별자치도 태백시 동태백로 880 (백산동)
35362005-01-01한국광해관리공단 강원지사033-550-9132강원특별자치도 태백시 소도길 9-12_ 구함태탄광폐수처리시설관리동 (소도동)
36372001-02-01(주)강원환경033-554-4500강원특별자치도 태백시 된각길 6-20 (적각동)
37381998-02-01한국수자원공사 태백권지사033-550-1236강원특별자치도 태백시 구와우길 66 (황지동_ 한국수자원공사)
38391999-03-01대한석탄공사 장성광업소033-581-7181강원특별자치도 태백시 태백로 1889 (장성동_대한석탄공사 장성광업소)
39401998-01-01태백상하수도사업소033-581-8589강원특별자치도 태백시 사군드리길 103-38 (동점동)