Overview

Dataset statistics

Number of variables8
Number of observations177
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.4 KiB
Average record size in memory65.7 B

Variable types

Numeric1
Categorical5
Text2

Dataset

Description청주시 가로쓰레기통 설치현황으로 청주시내 쓰레기통 설치 위치, 목적, 관할부서를 확인할 수 있습니다. 해당 데이터는 1년주기로 갱신하고 있으며 기타 궁금하신 사항이 있으면 관할 구청 환경위생과에서 답변이 가능합니다.
Author충청북도 청주시
URLhttps://www.data.go.kr/data/15087394/fileData.do

Alerts

시군구 has constant value ""Constant
연번 is highly overall correlated with 관리기관 and 1 other fieldsHigh correlation
관리기관 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
도로(가로)명 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
설치지점 is highly overall correlated with 도로(가로)명High correlation
설치지점 is highly imbalanced (92.5%)Imbalance
수거쓰레기 종류 is highly imbalanced (91.1%)Imbalance
연번 has unique valuesUnique
정류장명(번호) has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:15:05.132163
Analysis finished2023-12-12 18:15:06.102835
Duration0.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct177
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean89
Minimum1
Maximum177
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-13T03:15:06.207709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9.8
Q145
median89
Q3133
95-th percentile168.2
Maximum177
Range176
Interquartile range (IQR)88

Descriptive statistics

Standard deviation51.239633
Coefficient of variation (CV)0.57572621
Kurtosis-1.2
Mean89
Median Absolute Deviation (MAD)44
Skewness0
Sum15753
Variance2625.5
MonotonicityStrictly increasing
2023-12-13T03:15:06.423684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
134 1
 
0.6%
114 1
 
0.6%
115 1
 
0.6%
116 1
 
0.6%
117 1
 
0.6%
118 1
 
0.6%
119 1
 
0.6%
120 1
 
0.6%
121 1
 
0.6%
Other values (167) 167
94.4%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
177 1
0.6%
176 1
0.6%
175 1
0.6%
174 1
0.6%
173 1
0.6%
172 1
0.6%
171 1
0.6%
170 1
0.6%
169 1
0.6%
168 1
0.6%

시군구
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
충청북도 청주시
177 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청북도 청주시
2nd row충청북도 청주시
3rd row충청북도 청주시
4th row충청북도 청주시
5th row충청북도 청주시

Common Values

ValueCountFrequency (%)
충청북도 청주시 177
100.0%

Length

2023-12-13T03:15:06.596983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:15:06.711974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청북도 177
50.0%
청주시 177
50.0%

관리기관
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
서원구 환경위생과
66 
흥덕구 환경위생과
61 
청원구 환경위생과
39 
상당구 환경위생과
11 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상당구 환경위생과
2nd row상당구 환경위생과
3rd row상당구 환경위생과
4th row상당구 환경위생과
5th row상당구 환경위생과

Common Values

ValueCountFrequency (%)
서원구 환경위생과 66
37.3%
흥덕구 환경위생과 61
34.5%
청원구 환경위생과 39
22.0%
상당구 환경위생과 11
 
6.2%

Length

2023-12-13T03:15:06.859380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:15:07.025690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
환경위생과 177
50.0%
서원구 66
 
18.6%
흥덕구 61
 
17.2%
청원구 39
 
11.0%
상당구 11
 
3.1%

도로(가로)명
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)15.3%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
사직대로
19 
청남로
16 
상당로
15 
제1순환로
14 
충청대로
11 
Other values (22)
102 

Length

Max length5
Median length3
Mean length3.5423729
Min length3

Unique

Unique3 ?
Unique (%)1.7%

Sample

1st row상당로
2nd row상당로
3rd row상당로
4th row상당로
5th row상당로

Common Values

ValueCountFrequency (%)
사직대로 19
 
10.7%
청남로 16
 
9.0%
상당로 15
 
8.5%
제1순환로 14
 
7.9%
충청대로 11
 
6.2%
서부로 11
 
6.2%
가로수로 10
 
5.6%
사운로 9
 
5.1%
오송읍 9
 
5.1%
1순환로 8
 
4.5%
Other values (17) 55
31.1%

Length

2023-12-13T03:15:07.194221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
사직대로 19
 
10.7%
청남로 16
 
9.0%
상당로 15
 
8.5%
제1순환로 14
 
7.9%
충청대로 11
 
6.2%
서부로 11
 
6.2%
가로수로 10
 
5.6%
사운로 9
 
5.1%
오송읍 9
 
5.1%
1순환로 8
 
4.5%
Other values (17) 55
31.1%
Distinct170
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-13T03:15:07.632770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length7.9887006
Min length5

Characters and Unicode

Total characters1414
Distinct characters91
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique165 ?
Unique (%)93.2%

Sample

1st row상당로 18
2nd row상당로 34
3rd row중앙로 80
4th row상당로 184
5th row상당로 118
ValueCountFrequency (%)
사직대로 18
 
5.2%
상당로 13
 
3.7%
청남로 11
 
3.2%
1순환로 10
 
2.9%
복대동 9
 
2.6%
개신동 9
 
2.6%
충청대로 9
 
2.6%
사운로 8
 
2.3%
공항로 7
 
2.0%
가경동 5
 
1.4%
Other values (208) 250
71.6%
2023-12-13T03:15:08.179956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
172
 
12.2%
1 149
 
10.5%
111
 
7.9%
4 58
 
4.1%
2 58
 
4.1%
3 57
 
4.0%
0 55
 
3.9%
- 53
 
3.7%
53
 
3.7%
7 52
 
3.7%
Other values (81) 596
42.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 606
42.9%
Other Letter 583
41.2%
Space Separator 172
 
12.2%
Dash Punctuation 53
 
3.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
111
19.0%
53
 
9.1%
45
 
7.7%
28
 
4.8%
24
 
4.1%
21
 
3.6%
17
 
2.9%
13
 
2.2%
13
 
2.2%
13
 
2.2%
Other values (69) 245
42.0%
Decimal Number
ValueCountFrequency (%)
1 149
24.6%
4 58
 
9.6%
2 58
 
9.6%
3 57
 
9.4%
0 55
 
9.1%
7 52
 
8.6%
6 52
 
8.6%
5 49
 
8.1%
8 42
 
6.9%
9 34
 
5.6%
Space Separator
ValueCountFrequency (%)
172
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 53
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 831
58.8%
Hangul 583
41.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
111
19.0%
53
 
9.1%
45
 
7.7%
28
 
4.8%
24
 
4.1%
21
 
3.6%
17
 
2.9%
13
 
2.2%
13
 
2.2%
13
 
2.2%
Other values (69) 245
42.0%
Common
ValueCountFrequency (%)
172
20.7%
1 149
17.9%
4 58
 
7.0%
2 58
 
7.0%
3 57
 
6.9%
0 55
 
6.6%
- 53
 
6.4%
7 52
 
6.3%
6 52
 
6.3%
5 49
 
5.9%
Other values (2) 76
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 831
58.8%
Hangul 583
41.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
172
20.7%
1 149
17.9%
4 58
 
7.0%
2 58
 
7.0%
3 57
 
6.9%
0 55
 
6.6%
- 53
 
6.4%
7 52
 
6.3%
6 52
 
6.3%
5 49
 
5.9%
Other values (2) 76
9.1%
Hangul
ValueCountFrequency (%)
111
19.0%
53
 
9.1%
45
 
7.7%
28
 
4.8%
24
 
4.1%
21
 
3.6%
17
 
2.9%
13
 
2.2%
13
 
2.2%
13
 
2.2%
Other values (69) 245
42.0%

설치지점
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
버스정류장
174 
택시승강장
 
1
기타
 
1
지하도입구
 
1

Length

Max length5
Median length5
Mean length4.9830508
Min length2

Unique

Unique3 ?
Unique (%)1.7%

Sample

1st row버스정류장
2nd row버스정류장
3rd row버스정류장
4th row버스정류장
5th row버스정류장

Common Values

ValueCountFrequency (%)
버스정류장 174
98.3%
택시승강장 1
 
0.6%
기타 1
 
0.6%
지하도입구 1
 
0.6%

Length

2023-12-13T03:15:08.372476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:15:08.520941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
버스정류장 174
98.3%
택시승강장 1
 
0.6%
기타 1
 
0.6%
지하도입구 1
 
0.6%
Distinct177
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-13T03:15:08.832032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length12.446328
Min length5

Characters and Unicode

Total characters2203
Distinct characters203
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique177 ?
Unique (%)100.0%

Sample

1st row육거리(1614)
2nd row서운동(1612)
3rd row방아다리(1507)
4th row방아다리(1508)
5th row상당공원(1504)
ValueCountFrequency (%)
시외버스터미널 3
 
1.5%
방향 3
 
1.5%
3
 
1.5%
충북대학교 2
 
1.0%
청주여객터미널 2
 
1.0%
농수산물도매시장 2
 
1.0%
육거리(1614 1
 
0.5%
북문(3223 1
 
0.5%
솔밭초등학교.신영지웰시티(2210 1
 
0.5%
시외버스터미널(2055 1
 
0.5%
Other values (185) 185
90.7%
2023-12-13T03:15:09.414811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 173
 
7.9%
) 173
 
7.9%
1 166
 
7.5%
2 86
 
3.9%
5 82
 
3.7%
0 74
 
3.4%
3 64
 
2.9%
4 56
 
2.5%
6 51
 
2.3%
39
 
1.8%
Other values (193) 1239
56.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1128
51.2%
Decimal Number 674
30.6%
Open Punctuation 173
 
7.9%
Close Punctuation 173
 
7.9%
Space Separator 30
 
1.4%
Other Punctuation 14
 
0.6%
Uppercase Letter 6
 
0.3%
Dash Punctuation 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
3.5%
33
 
2.9%
27
 
2.4%
27
 
2.4%
26
 
2.3%
26
 
2.3%
26
 
2.3%
26
 
2.3%
25
 
2.2%
25
 
2.2%
Other values (173) 848
75.2%
Decimal Number
ValueCountFrequency (%)
1 166
24.6%
2 86
12.8%
5 82
12.2%
0 74
11.0%
3 64
 
9.5%
4 56
 
8.3%
6 51
 
7.6%
7 35
 
5.2%
8 32
 
4.7%
9 28
 
4.2%
Other Punctuation
ValueCountFrequency (%)
, 7
50.0%
. 5
35.7%
/ 2
 
14.3%
Uppercase Letter
ValueCountFrequency (%)
S 2
33.3%
B 2
33.3%
K 2
33.3%
Open Punctuation
ValueCountFrequency (%)
( 173
100.0%
Close Punctuation
ValueCountFrequency (%)
) 173
100.0%
Space Separator
ValueCountFrequency (%)
30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1128
51.2%
Common 1069
48.5%
Latin 6
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
3.5%
33
 
2.9%
27
 
2.4%
27
 
2.4%
26
 
2.3%
26
 
2.3%
26
 
2.3%
26
 
2.3%
25
 
2.2%
25
 
2.2%
Other values (173) 848
75.2%
Common
ValueCountFrequency (%)
( 173
16.2%
) 173
16.2%
1 166
15.5%
2 86
8.0%
5 82
7.7%
0 74
6.9%
3 64
 
6.0%
4 56
 
5.2%
6 51
 
4.8%
7 35
 
3.3%
Other values (7) 109
10.2%
Latin
ValueCountFrequency (%)
S 2
33.3%
B 2
33.3%
K 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1128
51.2%
ASCII 1075
48.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 173
16.1%
) 173
16.1%
1 166
15.4%
2 86
8.0%
5 82
7.6%
0 74
6.9%
3 64
 
6.0%
4 56
 
5.2%
6 51
 
4.7%
7 35
 
3.3%
Other values (10) 115
10.7%
Hangul
ValueCountFrequency (%)
39
 
3.5%
33
 
2.9%
27
 
2.4%
27
 
2.4%
26
 
2.3%
26
 
2.3%
26
 
2.3%
26
 
2.3%
25
 
2.2%
25
 
2.2%
Other values (173) 848
75.2%

수거쓰레기 종류
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
일반쓰레기 수거용
175 
일반쓰레기, 재활용쓰레기 수거용
 
2

Length

Max length17
Median length9
Mean length9.0903955
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반쓰레기 수거용
2nd row일반쓰레기 수거용
3rd row일반쓰레기 수거용
4th row일반쓰레기 수거용
5th row일반쓰레기 수거용

Common Values

ValueCountFrequency (%)
일반쓰레기 수거용 175
98.9%
일반쓰레기, 재활용쓰레기 수거용 2
 
1.1%

Length

2023-12-13T03:15:09.649963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:15:09.838604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반쓰레기 177
49.7%
수거용 177
49.7%
재활용쓰레기 2
 
0.6%

Interactions

2023-12-13T03:15:05.695714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:15:09.960545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번관리기관도로(가로)명설치지점수거쓰레기 종류
연번1.0000.9490.9310.0660.300
관리기관0.9491.0000.9400.0000.233
도로(가로)명0.9310.9401.0000.7980.454
설치지점0.0660.0000.7981.0000.000
수거쓰레기 종류0.3000.2330.4540.0001.000
2023-12-13T03:15:10.119421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설치지점도로(가로)명수거쓰레기 종류관리기관
설치지점1.0000.5210.0000.000
도로(가로)명0.5211.0000.3610.747
수거쓰레기 종류0.0000.3611.0000.153
관리기관0.0000.7470.1531.000
2023-12-13T03:15:10.272743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번관리기관도로(가로)명설치지점수거쓰레기 종류
연번1.0000.8560.6580.0090.224
관리기관0.8561.0000.7470.0000.153
도로(가로)명0.6580.7471.0000.5210.361
설치지점0.0090.0000.5211.0000.000
수거쓰레기 종류0.2240.1530.3610.0001.000

Missing values

2023-12-13T03:15:05.853042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:15:06.031223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시군구관리기관도로(가로)명설치세부위치설치지점정류장명(번호)수거쓰레기 종류
01충청북도 청주시상당구 환경위생과상당로상당로 18버스정류장육거리(1614)일반쓰레기 수거용
12충청북도 청주시상당구 환경위생과상당로상당로 34버스정류장서운동(1612)일반쓰레기 수거용
23충청북도 청주시상당구 환경위생과상당로중앙로 80버스정류장방아다리(1507)일반쓰레기 수거용
34충청북도 청주시상당구 환경위생과상당로상당로 184버스정류장방아다리(1508)일반쓰레기 수거용
45충청북도 청주시상당구 환경위생과상당로상당로 118버스정류장상당공원(1504)일반쓰레기 수거용
56충청북도 청주시상당구 환경위생과상당로상당로 155버스정류장시청(1505)일반쓰레기 수거용
67충청북도 청주시상당구 환경위생과상당로상당로 150-1버스정류장시청(1506)일반쓰레기 수거용
78충청북도 청주시상당구 환경위생과상당로상당로 111-1버스정류장상당공원(1503)일반쓰레기 수거용
89충청북도 청주시상당구 환경위생과상당로상당로 49버스정류장용두사지철당간(1611)일반쓰레기 수거용
910충청북도 청주시상당구 환경위생과사직대로사직대로 358버스정류장청주대교(1543)일반쓰레기 수거용
연번시군구관리기관도로(가로)명설치세부위치설치지점정류장명(번호)수거쓰레기 종류
167168충청북도 청주시청원구 환경위생과충청대로충청대로 135버스정류장신흥고등학교(1442)일반쓰레기 수거용
168169충청북도 청주시청원구 환경위생과충청대로충청대로 151버스정류장럭키아파트(1443)일반쓰레기 수거용
169170충청북도 청주시청원구 환경위생과충청대로충청대로 121버스정류장신흥고등학교(1441)일반쓰레기 수거용
170171충청북도 청주시청원구 환경위생과율봉로율봉로 141지하도입구농협 율량동지점 앞 지하도 입구일반쓰레기 수거용
171172충청북도 청주시청원구 환경위생과충청대로주중동 1061버스정류장마로니에공원(1445)일반쓰레기 수거용
172173충청북도 청주시청원구 환경위생과충청대로주성동 344버스정류장주중동(1446)일반쓰레기 수거용
173174충청북도 청주시청원구 환경위생과직지대로직지대로 879버스정류장청주여객터미널 북부정류장(하차장 앞)일반쓰레기, 재활용쓰레기 수거용
174175충청북도 청주시청원구 환경위생과직지대로직지대로 874-2버스정류장청주여객터미널 북부정류장(승차장 앞)일반쓰레기, 재활용쓰레기 수거용
175176충청북도 청주시청원구 환경위생과충청대로충청대로 99버스정류장동양일보(1439)일반쓰레기 수거용
176177충청북도 청주시청원구 환경위생과충청대로충청대로 100버스정류장동양일보(1440)일반쓰레기 수거용