Overview

Dataset statistics

Number of variables8
Number of observations4191
Missing cells75
Missing cells (%)0.2%
Duplicate rows55
Duplicate rows (%)1.3%
Total size in memory266.2 KiB
Average record size in memory65.0 B

Variable types

DateTime1
Categorical4
Text2
Numeric1

Dataset

Description전라북도 고창군 대형폐기물 수거 현황( 수거일자, 폐기물 구분, 폐기물 명, 폐기물 규경, 개수, 행정동, 관리기관, 전화번호)에 관한 데이터입니다.
URLhttps://www.data.go.kr/data/15097585/fileData.do

Alerts

행정동 has constant value ""Constant
관리기관 has constant value ""Constant
Dataset has 55 (1.3%) duplicate rowsDuplicates
폐기물 구분 is highly imbalanced (61.1%)Imbalance
개수 has 73 (1.7%) missing valuesMissing

Reproduction

Analysis started2023-12-12 17:52:16.333608
Analysis finished2023-12-12 17:52:17.673647
Duration1.34 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct612
Distinct (%)14.6%
Missing0
Missing (%)0.0%
Memory size32.9 KiB
Minimum2021-01-20 00:00:00
Maximum2023-08-21 00:00:00
2023-12-13T02:52:17.784364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:52:17.967425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

폐기물 구분
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size32.9 KiB
가구류
3265 
기타
919 
가구
 
6
가루규
 
1

Length

Max length3
Median length3
Mean length2.779289
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row기타
2nd row가구류
3rd row기타
4th row가구
5th row기타

Common Values

ValueCountFrequency (%)
가구류 3265
77.9%
기타 919
 
21.9%
가구 6
 
0.1%
가루규 1
 
< 0.1%

Length

2023-12-13T02:52:18.154082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:52:18.269572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가구류 3265
77.9%
기타 919
 
21.9%
가구 6
 
0.1%
가루규 1
 
< 0.1%
Distinct2705
Distinct (%)64.6%
Missing2
Missing (%)< 0.1%
Memory size32.9 KiB
2023-12-13T02:52:18.628365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length90
Median length67
Mean length10.142516
Min length2

Characters and Unicode

Total characters42487
Distinct characters466
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2449 ?
Unique (%)58.5%

Sample

1st row의자2+ 상3+ 옷걸이+ 돗자리 등
2nd row싱글침대1+ CD장1+ 책장2+ 책상1+ 쇼파베드1
3rd row전기장판1
4th row책상1+ 쇼파1+ 침대 프레임1
5th row침대매트리스2
ValueCountFrequency (%)
매트리스1 467
 
5.2%
의자1 318
 
3.5%
246
 
2.7%
쇼파1 211
 
2.3%
책상1 196
 
2.2%
소파1 181
 
2.0%
서랍장1 174
 
1.9%
의자2 142
 
1.6%
장롱1 139
 
1.5%
단스1 130
 
1.4%
Other values (2019) 6795
75.5%
2023-12-13T02:52:19.538599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 5193
 
12.2%
4878
 
11.5%
+ 4068
 
9.6%
1763
 
4.1%
2 1210
 
2.8%
1209
 
2.8%
1148
 
2.7%
1126
 
2.7%
1059
 
2.5%
989
 
2.3%
Other values (456) 19844
46.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25396
59.8%
Decimal Number 7427
 
17.5%
Space Separator 4878
 
11.5%
Math Symbol 4089
 
9.6%
Close Punctuation 238
 
0.6%
Open Punctuation 238
 
0.6%
Uppercase Letter 156
 
0.4%
Lowercase Letter 38
 
0.1%
Other Punctuation 11
 
< 0.1%
Modifier Symbol 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1763
 
6.9%
1209
 
4.8%
1148
 
4.5%
1126
 
4.4%
1059
 
4.2%
989
 
3.9%
839
 
3.3%
737
 
2.9%
732
 
2.9%
626
 
2.5%
Other values (418) 15168
59.7%
Decimal Number
ValueCountFrequency (%)
1 5193
69.9%
2 1210
 
16.3%
3 496
 
6.7%
4 232
 
3.1%
5 129
 
1.7%
6 64
 
0.9%
0 52
 
0.7%
7 32
 
0.4%
8 15
 
0.2%
9 4
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
V 70
44.9%
T 70
44.9%
L 6
 
3.8%
D 2
 
1.3%
C 2
 
1.3%
S 2
 
1.3%
Q 1
 
0.6%
P 1
 
0.6%
R 1
 
0.6%
F 1
 
0.6%
Lowercase Letter
ValueCountFrequency (%)
v 15
39.5%
t 14
36.8%
k 3
 
7.9%
g 3
 
7.9%
c 1
 
2.6%
r 1
 
2.6%
s 1
 
2.6%
Other Punctuation
ValueCountFrequency (%)
. 7
63.6%
: 2
 
18.2%
& 1
 
9.1%
/ 1
 
9.1%
Math Symbol
ValueCountFrequency (%)
+ 4068
99.5%
~ 21
 
0.5%
Space Separator
ValueCountFrequency (%)
4878
100.0%
Close Punctuation
ValueCountFrequency (%)
) 238
100.0%
Open Punctuation
ValueCountFrequency (%)
( 238
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25396
59.8%
Common 16897
39.8%
Latin 194
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1763
 
6.9%
1209
 
4.8%
1148
 
4.5%
1126
 
4.4%
1059
 
4.2%
989
 
3.9%
839
 
3.3%
737
 
2.9%
732
 
2.9%
626
 
2.5%
Other values (418) 15168
59.7%
Common
ValueCountFrequency (%)
1 5193
30.7%
4878
28.9%
+ 4068
24.1%
2 1210
 
7.2%
3 496
 
2.9%
) 238
 
1.4%
( 238
 
1.4%
4 232
 
1.4%
5 129
 
0.8%
6 64
 
0.4%
Other values (11) 151
 
0.9%
Latin
ValueCountFrequency (%)
V 70
36.1%
T 70
36.1%
v 15
 
7.7%
t 14
 
7.2%
L 6
 
3.1%
k 3
 
1.5%
g 3
 
1.5%
D 2
 
1.0%
C 2
 
1.0%
S 2
 
1.0%
Other values (7) 7
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25396
59.8%
ASCII 17091
40.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 5193
30.4%
4878
28.5%
+ 4068
23.8%
2 1210
 
7.1%
3 496
 
2.9%
) 238
 
1.4%
( 238
 
1.4%
4 232
 
1.4%
5 129
 
0.8%
V 70
 
0.4%
Other values (28) 339
 
2.0%
Hangul
ValueCountFrequency (%)
1763
 
6.9%
1209
 
4.8%
1148
 
4.5%
1126
 
4.4%
1059
 
4.2%
989
 
3.9%
839
 
3.3%
737
 
2.9%
732
 
2.9%
626
 
2.5%
Other values (418) 15168
59.7%

폐기물 규격
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size32.9 KiB
대형
2500 
중형
902 
<NA>
429 
소형
360 

Length

Max length4
Median length2
Mean length2.2047244
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중형
2nd row중형
3rd row소형
4th row중형
5th row중형

Common Values

ValueCountFrequency (%)
대형 2500
59.7%
중형 902
 
21.5%
<NA> 429
 
10.2%
소형 360
 
8.6%

Length

2023-12-13T02:52:19.753977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:52:19.945493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대형 2500
59.7%
중형 902
 
21.5%
na 429
 
10.2%
소형 360
 
8.6%

개수
Real number (ℝ)

MISSING 

Distinct33
Distinct (%)0.8%
Missing73
Missing (%)1.7%
Infinite0
Infinite (%)0.0%
Mean3.0271977
Minimum1
Maximum67
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size37.0 KiB
2023-12-13T02:52:20.105445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q34
95-th percentile9
Maximum67
Range66
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.3936514
Coefficient of variation (CV)1.1210538
Kurtosis54.56224
Mean3.0271977
Median Absolute Deviation (MAD)1
Skewness5.1928899
Sum12466
Variance11.51687
MonotonicityNot monotonic
2023-12-13T02:52:20.301039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
1 1618
38.6%
2 879
21.0%
3 532
 
12.7%
4 347
 
8.3%
5 225
 
5.4%
6 141
 
3.4%
7 86
 
2.1%
8 72
 
1.7%
10 53
 
1.3%
9 43
 
1.0%
Other values (23) 122
 
2.9%
(Missing) 73
 
1.7%
ValueCountFrequency (%)
1 1618
38.6%
2 879
21.0%
3 532
 
12.7%
4 347
 
8.3%
5 225
 
5.4%
6 141
 
3.4%
7 86
 
2.1%
8 72
 
1.7%
9 43
 
1.0%
10 53
 
1.3%
ValueCountFrequency (%)
67 1
< 0.1%
50 1
< 0.1%
37 1
< 0.1%
35 1
< 0.1%
33 1
< 0.1%
32 2
< 0.1%
30 1
< 0.1%
28 1
< 0.1%
27 1
< 0.1%
26 1
< 0.1%

행정동
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size32.9 KiB
고창읍
4191 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고창읍
2nd row고창읍
3rd row고창읍
4th row고창읍
5th row고창읍

Common Values

ValueCountFrequency (%)
고창읍 4191
100.0%

Length

2023-12-13T02:52:20.489247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:52:20.627382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고창읍 4191
100.0%

관리기관
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size32.9 KiB
환경미화팀
4191 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row환경미화팀
2nd row환경미화팀
3rd row환경미화팀
4th row환경미화팀
5th row환경미화팀

Common Values

ValueCountFrequency (%)
환경미화팀 4191
100.0%

Length

2023-12-13T02:52:20.762837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:52:20.930418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
환경미화팀 4191
100.0%
Distinct1354
Distinct (%)32.3%
Missing0
Missing (%)0.0%
Memory size32.9 KiB
2023-12-13T02:52:21.295822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters50292
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1353 ?
Unique (%)32.3%

Sample

1st row063-560-8109
2nd row063-560-8109
3rd row063-560-8109
4th row063-560-8109
5th row063-560-8109
ValueCountFrequency (%)
063-560-8109 2838
67.7%
063-560-9014 1
 
< 0.1%
063-560-9022 1
 
< 0.1%
063-560-9021 1
 
< 0.1%
063-560-9020 1
 
< 0.1%
063-560-9019 1
 
< 0.1%
063-560-9018 1
 
< 0.1%
063-560-9017 1
 
< 0.1%
063-560-9015 1
 
< 0.1%
063-560-9047 1
 
< 0.1%
Other values (1344) 1344
32.1%
2023-12-13T02:52:21.968166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 11584
23.0%
6 8757
17.4%
- 8382
16.7%
3 4667
9.3%
5 4561
 
9.1%
8 4084
 
8.1%
9 3671
 
7.3%
1 3303
 
6.6%
2 476
 
0.9%
4 445
 
0.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 41910
83.3%
Dash Punctuation 8382
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 11584
27.6%
6 8757
20.9%
3 4667
11.1%
5 4561
 
10.9%
8 4084
 
9.7%
9 3671
 
8.8%
1 3303
 
7.9%
2 476
 
1.1%
4 445
 
1.1%
7 362
 
0.9%
Dash Punctuation
ValueCountFrequency (%)
- 8382
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 50292
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 11584
23.0%
6 8757
17.4%
- 8382
16.7%
3 4667
9.3%
5 4561
 
9.1%
8 4084
 
8.1%
9 3671
 
7.3%
1 3303
 
6.6%
2 476
 
0.9%
4 445
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 50292
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 11584
23.0%
6 8757
17.4%
- 8382
16.7%
3 4667
9.3%
5 4561
 
9.1%
8 4084
 
8.1%
9 3671
 
7.3%
1 3303
 
6.6%
2 476
 
0.9%
4 445
 
0.9%

Interactions

2023-12-13T02:52:16.863828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:52:22.155084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물 구분폐기물 규격개수
폐기물 구분1.0000.2110.000
폐기물 규격0.2111.0000.139
개수0.0000.1391.000
2023-12-13T02:52:22.322987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물 구분폐기물 규격
폐기물 구분1.0000.201
폐기물 규격0.2011.000
2023-12-13T02:52:22.506516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개수폐기물 구분폐기물 규격
개수1.0000.0000.088
폐기물 구분0.0001.0000.201
폐기물 규격0.0880.2011.000

Missing values

2023-12-13T02:52:17.011806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:52:17.489722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T02:52:17.608624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

수거일자폐기물 구분폐기물 명폐기물 규격개수행정동관리기관전화번호
02021-01-22기타의자2+ 상3+ 옷걸이+ 돗자리 등중형8고창읍환경미화팀063-560-8109
12021-01-22가구류싱글침대1+ CD장1+ 책장2+ 책상1+ 쇼파베드1중형6고창읍환경미화팀063-560-8109
22021-01-22기타전기장판1소형1고창읍환경미화팀063-560-8109
32021-01-25가구책상1+ 쇼파1+ 침대 프레임1중형3고창읍환경미화팀063-560-8109
42021-01-25기타침대매트리스2중형2고창읍환경미화팀063-560-8109
52021-01-25기타매트리스1소형1고창읍환경미화팀063-560-8109
62021-01-25가구류책상1소형1고창읍환경미화팀063-560-8109
72021-01-25가구류선풍기1+ 식탁테이블1+ 의자2+ 선반1 등중형5고창읍환경미화팀063-560-8109
82021-01-25가구류책장2+ 책상1중형3고창읍환경미화팀063-560-8109
92021-01-25가구류쇼파1중형1고창읍환경미화팀063-560-8109
수거일자폐기물 구분폐기물 명폐기물 규격개수행정동관리기관전화번호
41812023-08-21가구류침대(분해)1대형1고창읍환경미화팀063-560-9461
41822023-08-21기타음식물쓰레기통(60L)1대형1고창읍환경미화팀063-560-9462
41832023-08-21기타거울틀소형1고창읍환경미화팀063-560-9463
41842023-08-21가구류매트1+ 쇼파1+ 매트리스1대형3고창읍환경미화팀063-560-9464
41852023-08-21가구류가구류(책상 등)대형1고창읍환경미화팀063-560-9465
41862023-08-21기타대자리2+ 밥상1대형3고창읍환경미화팀063-560-9466
41872023-08-21가구류쇼파1대형1고창읍환경미화팀063-560-9467
41882023-08-21가구류쇼파1+ 매트리스1+ 의자1대형3고창읍환경미화팀063-560-9468
41892023-08-21가구류쇼파1+ 매트1대형2고창읍환경미화팀063-560-9469
41902023-08-21가구류나무침대1+ 판넬 다수대형2고창읍환경미화팀063-560-9470

Duplicate rows

Most frequently occurring

수거일자폐기물 구분폐기물 명폐기물 규격개수행정동관리기관전화번호# duplicates
122021-06-21가구류쇼파1중형1고창읍환경미화팀063-560-81094
532022-11-21가구류소파1대형1고창읍환경미화팀063-560-81093
02021-02-22가구류쇼파1대형1고창읍환경미화팀063-560-81092
12021-03-22가구류침대세트(프레임+ 매트리스)2중형4고창읍환경미화팀063-560-81092
22021-03-24기타매트리스2중형2고창읍환경미화팀063-560-81092
32021-03-29가구류쇼파1대형1고창읍환경미화팀063-560-81092
42021-03-30기타매트리스1중형1고창읍환경미화팀063-560-81092
52021-04-02기타씽크대1<NA>1고창읍환경미화팀063-560-81092
62021-04-15가구류쇼파1중형1고창읍환경미화팀063-560-81092
72021-05-06기타변기1<NA>1고창읍환경미화팀063-560-81092