Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows20
Duplicate rows (%)0.2%
Total size in memory878.9 KiB
Average record size in memory90.0 B

Variable types

Categorical5
DateTime2
Text1
Numeric2

Dataset

Description충청북도 충주시 대형폐기물수거현황에 대한 데이터 제공(자치단체, 수거일자, 민원인명, 수거지역, 차량, 수거량, 금액, 담당부서, 문의전화, 기준일자)
URLhttps://www.data.go.kr/data/15042189/fileData.do

Alerts

자치단체 has constant value ""Constant
담당부서 has constant value ""Constant
문의전화 has constant value ""Constant
기준일자 has constant value ""Constant
Dataset has 20 (0.2%) duplicate rowsDuplicates
수거량 is highly overall correlated with 금액High correlation
금액 is highly overall correlated with 수거량High correlation
수거지역 is highly overall correlated with 차량High correlation
차량 is highly overall correlated with 수거지역High correlation

Reproduction

Analysis started2023-12-12 16:16:44.889822
Analysis finished2023-12-12 16:16:46.532136
Duration1.64 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자치단체
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
충청북도 충주시
10000 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청북도 충주시
2nd row충청북도 충주시
3rd row충청북도 충주시
4th row충청북도 충주시
5th row충청북도 충주시

Common Values

ValueCountFrequency (%)
충청북도 충주시 10000
100.0%

Length

2023-12-13T01:16:46.618252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:16:46.735478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청북도 10000
50.0%
충주시 10000
50.0%
Distinct298
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-07-02 00:00:00
Maximum2023-06-30 00:00:00
2023-12-13T01:16:46.872021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:16:47.033608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct7016
Distinct (%)70.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T01:16:47.420443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length3
Mean length4.5349
Min length2

Characters and Unicode

Total characters45349
Distinct characters423
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5552 ?
Unique (%)55.5%

Sample

1st row엄미정
2nd row이윤희/추가건
3rd row경비실
4th row권지미
5th row이미숙
ValueCountFrequency (%)
클린 155
 
1.5%
경비실 124
 
1.2%
다량-90마0400 25
 
0.2%
새한목행관리사무소(권승옥 18
 
0.2%
신우희가로apt 17
 
0.2%
다량-95어4220 15
 
0.1%
다량-85부9648 15
 
0.1%
다량-90더7663 15
 
0.1%
다량-87라6448 14
 
0.1%
다량-97두4730 13
 
0.1%
Other values (7030) 9634
95.9%
2023-12-13T01:16:47.958519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1992
 
4.4%
- 1981
 
4.4%
1877
 
4.1%
8 1782
 
3.9%
1564
 
3.4%
9 1515
 
3.3%
1345
 
3.0%
2 1163
 
2.6%
4 1113
 
2.5%
0 1108
 
2.4%
Other values (413) 29909
66.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31464
69.4%
Decimal Number 11403
 
25.1%
Dash Punctuation 1981
 
4.4%
Other Punctuation 189
 
0.4%
Close Punctuation 77
 
0.2%
Open Punctuation 74
 
0.2%
Lowercase Letter 59
 
0.1%
Uppercase Letter 57
 
0.1%
Space Separator 45
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1992
 
6.3%
1877
 
6.0%
1564
 
5.0%
1345
 
4.3%
926
 
2.9%
779
 
2.5%
615
 
2.0%
614
 
2.0%
599
 
1.9%
500
 
1.6%
Other values (362) 20653
65.6%
Uppercase Letter
ValueCountFrequency (%)
A 9
15.8%
T 6
10.5%
N 5
8.8%
J 4
 
7.0%
P 4
 
7.0%
I 4
 
7.0%
H 4
 
7.0%
G 3
 
5.3%
E 3
 
5.3%
S 3
 
5.3%
Other values (9) 12
21.1%
Lowercase Letter
ValueCountFrequency (%)
t 16
27.1%
p 15
25.4%
a 15
25.4%
e 3
 
5.1%
s 2
 
3.4%
k 1
 
1.7%
h 1
 
1.7%
o 1
 
1.7%
n 1
 
1.7%
y 1
 
1.7%
Other values (3) 3
 
5.1%
Decimal Number
ValueCountFrequency (%)
8 1782
15.6%
9 1515
13.3%
2 1163
10.2%
4 1113
9.8%
0 1108
9.7%
1 1070
9.4%
5 970
8.5%
3 938
8.2%
7 936
8.2%
6 808
7.1%
Other Punctuation
ValueCountFrequency (%)
/ 165
87.3%
* 17
 
9.0%
, 5
 
2.6%
. 2
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 74
96.1%
] 3
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 1981
100.0%
Open Punctuation
ValueCountFrequency (%)
( 74
100.0%
Space Separator
ValueCountFrequency (%)
45
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31464
69.4%
Common 13769
30.4%
Latin 116
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1992
 
6.3%
1877
 
6.0%
1564
 
5.0%
1345
 
4.3%
926
 
2.9%
779
 
2.5%
615
 
2.0%
614
 
2.0%
599
 
1.9%
500
 
1.6%
Other values (362) 20653
65.6%
Latin
ValueCountFrequency (%)
t 16
13.8%
p 15
12.9%
a 15
12.9%
A 9
 
7.8%
T 6
 
5.2%
N 5
 
4.3%
J 4
 
3.4%
P 4
 
3.4%
I 4
 
3.4%
H 4
 
3.4%
Other values (22) 34
29.3%
Common
ValueCountFrequency (%)
- 1981
14.4%
8 1782
12.9%
9 1515
11.0%
2 1163
8.4%
4 1113
8.1%
0 1108
8.0%
1 1070
7.8%
5 970
7.0%
3 938
6.8%
7 936
6.8%
Other values (9) 1193
8.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31464
69.4%
ASCII 13885
30.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1992
 
6.3%
1877
 
6.0%
1564
 
5.0%
1345
 
4.3%
926
 
2.9%
779
 
2.5%
615
 
2.0%
614
 
2.0%
599
 
1.9%
500
 
1.6%
Other values (362) 20653
65.6%
ASCII
ValueCountFrequency (%)
- 1981
14.3%
8 1782
12.8%
9 1515
10.9%
2 1163
8.4%
4 1113
8.0%
0 1108
8.0%
1 1070
7.7%
5 970
7.0%
3 938
6.8%
7 936
6.7%
Other values (41) 1309
9.4%

수거지역
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
칠금.금릉동
2745 
연수동
1360 
교현.안림동
955 
중앙탑면
747 
호암.직동
694 
Other values (21)
3499 

Length

Max length6
Median length5
Mean length4.5138
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row호암.직동
2nd row칠금.금릉동
3rd row교현.안림동
4th row문화동
5th row칠금.금릉동

Common Values

ValueCountFrequency (%)
칠금.금릉동 2745
27.5%
연수동 1360
13.6%
교현.안림동 955
 
9.6%
중앙탑면 747
 
7.5%
호암.직동 694
 
6.9%
용산동 628
 
6.3%
대소원면 433
 
4.3%
문화동 338
 
3.4%
봉방동 278
 
2.8%
목행.용탄동 263
 
2.6%
Other values (16) 1559
15.6%

Length

2023-12-13T01:16:48.148518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
칠금.금릉동 2745
27.5%
연수동 1360
13.6%
교현.안림동 955
 
9.6%
중앙탑면 747
 
7.5%
호암.직동 694
 
6.9%
용산동 628
 
6.3%
대소원면 433
 
4.3%
문화동 338
 
3.4%
봉방동 278
 
2.8%
목행.용탄동 263
 
2.6%
Other values (16) 1559
15.6%

차량
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
차량2
1555 
차량3
1464 
차량1
1420 
차량4
1372 
차량6
1160 
Other values (15)
3029 

Length

Max length4
Median length3
Mean length3.0062
Min length3

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row차량2
2nd row차량3
3rd row차량1
4th row차량1
5th row차량1

Common Values

ValueCountFrequency (%)
차량2 1555
15.6%
차량3 1464
14.6%
차량1 1420
14.2%
차량4 1372
13.7%
차량6 1160
11.6%
차량5 860
8.6%
운영3 593
 
5.9%
운영2 525
 
5.2%
운영1 501
 
5.0%
운영4 480
 
4.8%
Other values (10) 70
 
0.7%

Length

2023-12-13T01:16:48.282960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
차량2 1555
15.6%
차량3 1464
14.6%
차량1 1420
14.2%
차량4 1372
13.7%
차량6 1160
11.6%
차량5 860
8.6%
운영3 593
 
5.9%
운영2 525
 
5.2%
운영1 501
 
5.0%
운영4 480
 
4.8%
Other values (10) 70
 
0.7%

수거량
Real number (ℝ)

HIGH CORRELATION 

Distinct38
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.4949
Minimum0
Maximum54
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:16:48.433091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q35
95-th percentile11
Maximum54
Range54
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.7377227
Coefficient of variation (CV)1.0694792
Kurtosis14.774184
Mean3.4949
Median Absolute Deviation (MAD)1
Skewness2.8055242
Sum34949
Variance13.970571
MonotonicityNot monotonic
2023-12-13T01:16:48.591141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
1 4217
42.2%
2 1688
16.9%
3 873
 
8.7%
4 623
 
6.2%
5 476
 
4.8%
6 422
 
4.2%
7 372
 
3.7%
8 283
 
2.8%
9 282
 
2.8%
10 231
 
2.3%
Other values (28) 533
 
5.3%
ValueCountFrequency (%)
0 2
 
< 0.1%
1 4217
42.2%
2 1688
16.9%
3 873
 
8.7%
4 623
 
6.2%
5 476
 
4.8%
6 422
 
4.2%
7 372
 
3.7%
8 283
 
2.8%
9 282
 
2.8%
ValueCountFrequency (%)
54 1
 
< 0.1%
48 1
 
< 0.1%
42 1
 
< 0.1%
37 2
< 0.1%
34 1
 
< 0.1%
33 3
< 0.1%
32 1
 
< 0.1%
31 1
 
< 0.1%
30 4
< 0.1%
28 2
< 0.1%

금액
Real number (ℝ)

HIGH CORRELATION 

Distinct230
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19904.45
Minimum0
Maximum708000
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:16:48.754459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2000
Q14000
median8000
Q320000
95-th percentile80000
Maximum708000
Range708000
Interquartile range (IQR)16000

Descriptive statistics

Standard deviation32230.389
Coefficient of variation (CV)1.6192554
Kurtosis49.614738
Mean19904.45
Median Absolute Deviation (MAD)5000
Skewness5.0535671
Sum1.990445 × 108
Variance1.038798 × 109
MonotonicityNot monotonic
2023-12-13T01:16:48.907121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3000 1137
 
11.4%
6000 828
 
8.3%
4000 822
 
8.2%
2000 755
 
7.5%
5000 731
 
7.3%
8000 664
 
6.6%
10000 518
 
5.2%
7000 272
 
2.7%
9000 229
 
2.3%
15000 229
 
2.3%
Other values (220) 3815
38.1%
ValueCountFrequency (%)
0 2
 
< 0.1%
1000 11
 
0.1%
2000 755
7.5%
3000 1137
11.4%
4000 822
8.2%
5000 731
7.3%
6000 828
8.3%
6500 2
 
< 0.1%
7000 272
 
2.7%
7500 1
 
< 0.1%
ValueCountFrequency (%)
708000 1
< 0.1%
505000 1
< 0.1%
500000 1
< 0.1%
410000 1
< 0.1%
401000 1
< 0.1%
385000 1
< 0.1%
323000 1
< 0.1%
311000 1
< 0.1%
310000 2
< 0.1%
300000 1
< 0.1%

담당부서
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
충주시청 자원순환과
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충주시청 자원순환과
2nd row충주시청 자원순환과
3rd row충주시청 자원순환과
4th row충주시청 자원순환과
5th row충주시청 자원순환과

Common Values

ValueCountFrequency (%)
충주시청 자원순환과 10000
100.0%

Length

2023-12-13T01:16:49.062568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:16:49.159532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충주시청 10000
50.0%
자원순환과 10000
50.0%

문의전화
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
043-850-6991
10000 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row043-850-6991
2nd row043-850-6991
3rd row043-850-6991
4th row043-850-6991
5th row043-850-6991

Common Values

ValueCountFrequency (%)
043-850-6991 10000
100.0%

Length

2023-12-13T01:16:49.277506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:16:49.681390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
043-850-6991 10000
100.0%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-06-30 00:00:00
Maximum2023-06-30 00:00:00
2023-12-13T01:16:49.782729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:16:49.895091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T01:16:45.942045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:16:45.696823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:16:46.078649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:16:45.812831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:16:49.970051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수거지역차량수거량금액
수거지역1.0000.9150.3820.278
차량0.9151.0000.4570.351
수거량0.3820.4571.0000.384
금액0.2780.3510.3841.000
2023-12-13T01:16:50.069054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수거지역차량
수거지역1.0000.525
차량0.5251.000
2023-12-13T01:16:50.153582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수거량금액수거지역차량
수거량1.0000.8190.1440.161
금액0.8191.0000.1140.148
수거지역0.1440.1141.0000.525
차량0.1610.1480.5251.000

Missing values

2023-12-13T01:16:46.264242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:16:46.442976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자치단체수거일자민원인명수거지역차량수거량금액담당부서문의전화기준일자
26454충청북도 충주시2023-01-16엄미정호암.직동차량236000충주시청 자원순환과043-850-69912023-06-30
9186충청북도 충주시2022-09-02이윤희/추가건칠금.금릉동차량3480000충주시청 자원순환과043-850-69912023-06-30
48322충청북도 충주시2023-06-08경비실교현.안림동차량1411000충주시청 자원순환과043-850-69912023-06-30
4471충청북도 충주시2022-08-03권지미문화동차량124000충주시청 자원순환과043-850-69912023-06-30
211충청북도 충주시2022-07-04이미숙칠금.금릉동차량112000충주시청 자원순환과043-850-69912023-06-30
50588충청북도 충주시2023-06-23다량-15부3329칠금.금릉동운영133000충주시청 자원순환과043-850-69912023-06-30
21214충청북도 충주시2022-11-26김향남교현.안림동차량4115000충주시청 자원순환과043-850-69912023-06-30
22155충청북도 충주시2022-12-05김은경연수동차량3211000충주시청 자원순환과043-850-69912023-06-30
24779충청북도 충주시2023-01-02조용석교현.안림동차량413000충주시청 자원순환과043-850-69912023-06-30
23114충청북도 충주시2022-12-14한만홍교현.안림동차량413000충주시청 자원순환과043-850-69912023-06-30
자치단체수거일자민원인명수거지역차량수거량금액담당부서문의전화기준일자
51201충청북도 충주시2023-06-27안광수지현동차량2629000충주시청 자원순환과043-850-69912023-06-30
6425충청북도 충주시2022-08-17박종명대소원면차량5312000충주시청 자원순환과043-850-69912023-06-30
24092충청북도 충주시2022-12-26이수연연수동차량324000충주시청 자원순환과043-850-69912023-06-30
2181충청북도 충주시2022-07-18주순이문화동차량115000충주시청 자원순환과043-850-69912023-06-30
37931충청북도 충주시2023-03-30김경원산척면차량615000충주시청 자원순환과043-850-69912023-06-30
43028충청북도 충주시2023-05-03채한규앙성면차량6854000충주시청 자원순환과043-850-69912023-06-30
10093충청북도 충주시2022-09-08최은진연수동차량3624000충주시청 자원순환과043-850-69912023-06-30
14573충청북도 충주시2022-10-08박은주교현.안림동차량413000충주시청 자원순환과043-850-69912023-06-30
39728충청북도 충주시2023-04-11김종례용산동차량2637000충주시청 자원순환과043-850-69912023-06-30
45432충청북도 충주시2023-05-19황민정연수동차량325000충주시청 자원순환과043-850-69912023-06-30

Duplicate rows

Most frequently occurring

자치단체수거일자민원인명수거지역차량수거량금액담당부서문의전화기준일자# duplicates
14충청북도 충주시2023-05-24클린칠금.금릉동운영315000충주시청 자원순환과043-850-69912023-06-303
0충청북도 충주시2022-07-14장인영연수동차량3313000충주시청 자원순환과043-850-69912023-06-302
1충청북도 충주시2022-07-21경비실연수동차량315000충주시청 자원순환과043-850-69912023-06-302
2충청북도 충주시2022-07-26이상희목행.용탄동차량415000충주시청 자원순환과043-850-69912023-06-302
3충청북도 충주시2022-10-04김성욱대소원면차량512000충주시청 자원순환과043-850-69912023-06-302
4충청북도 충주시2022-10-29김수경연수동차량312000충주시청 자원순환과043-850-69912023-06-302
5충청북도 충주시2022-11-10김윤정칠금.금릉동차량114000충주시청 자원순환과043-850-69912023-06-302
6충청북도 충주시2023-01-12김은미봉방동차량116000충주시청 자원순환과043-850-69912023-06-302
7충청북도 충주시2023-03-14정태윤용산동차량225000충주시청 자원순환과043-850-69912023-06-302
8충청북도 충주시2023-03-20문미영교현2동차량4211000충주시청 자원순환과043-850-69912023-06-302