Overview

Dataset statistics

Number of variables4
Number of observations1398
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory43.8 KiB
Average record size in memory32.1 B

Variable types

Text2
Categorical2

Dataset

Description천안시시설관리공단에서 종량제 판매소 현황입니다.데이터에는 주소, 위치, 배송요일 등으로 구성되어있습니다.
Author천안시시설관리공단
URLhttps://www.data.go.kr/data/15095346/fileData.do

Alerts

구역 is highly overall correlated with 배송요일High correlation
배송요일 is highly overall correlated with 구역High correlation

Reproduction

Analysis started2024-04-21 02:04:48.369976
Analysis finished2024-04-21 02:04:50.013097
Duration1.64 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1289
Distinct (%)92.2%
Missing0
Missing (%)0.0%
Memory size11.1 KiB
2024-04-21T11:04:50.195245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length19
Mean length8.4728183
Min length2

Characters and Unicode

Total characters11845
Distinct characters447
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1221 ?
Unique (%)87.3%

Sample

1st rowGS25 온누리점
2nd row광덕공판장
3rd row광덕상회
4th row농원가든
5th row농협 하나로마트 광덕지점
ValueCountFrequency (%)
gs25 283
 
11.4%
씨유 268
 
10.8%
세븐일레븐 203
 
8.2%
이마트24 87
 
3.5%
주)코리아세븐 23
 
0.9%
중앙점 19
 
0.8%
타운점 15
 
0.6%
불당점 14
 
0.6%
쌍용스토아 13
 
0.5%
농가마트 11
 
0.4%
Other values (1163) 1538
62.2%
2024-04-21T11:04:50.536633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1084
 
9.2%
972
 
8.2%
439
 
3.7%
2 396
 
3.3%
338
 
2.9%
318
 
2.7%
308
 
2.6%
5 294
 
2.5%
S 287
 
2.4%
G 286
 
2.4%
Other values (437) 7123
60.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9111
76.9%
Space Separator 1084
 
9.2%
Decimal Number 809
 
6.8%
Uppercase Letter 632
 
5.3%
Open Punctuation 96
 
0.8%
Close Punctuation 96
 
0.8%
Lowercase Letter 13
 
0.1%
Dash Punctuation 3
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
972
 
10.7%
439
 
4.8%
338
 
3.7%
318
 
3.5%
308
 
3.4%
279
 
3.1%
252
 
2.8%
241
 
2.6%
225
 
2.5%
223
 
2.4%
Other values (393) 5516
60.5%
Uppercase Letter
ValueCountFrequency (%)
S 287
45.4%
G 286
45.3%
C 11
 
1.7%
I 6
 
0.9%
M 6
 
0.9%
L 5
 
0.8%
R 4
 
0.6%
K 4
 
0.6%
D 3
 
0.5%
H 3
 
0.5%
Other values (9) 17
 
2.7%
Decimal Number
ValueCountFrequency (%)
2 396
48.9%
5 294
36.3%
4 100
 
12.4%
1 8
 
1.0%
6 3
 
0.4%
3 3
 
0.4%
8 2
 
0.2%
7 1
 
0.1%
0 1
 
0.1%
9 1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
t 2
15.4%
r 2
15.4%
a 2
15.4%
o 1
7.7%
e 1
7.7%
c 1
7.7%
i 1
7.7%
j 1
7.7%
m 1
7.7%
s 1
7.7%
Space Separator
ValueCountFrequency (%)
1084
100.0%
Open Punctuation
ValueCountFrequency (%)
( 96
100.0%
Close Punctuation
ValueCountFrequency (%)
) 96
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9111
76.9%
Common 2089
 
17.6%
Latin 645
 
5.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
972
 
10.7%
439
 
4.8%
338
 
3.7%
318
 
3.5%
308
 
3.4%
279
 
3.1%
252
 
2.8%
241
 
2.6%
225
 
2.5%
223
 
2.4%
Other values (393) 5516
60.5%
Latin
ValueCountFrequency (%)
S 287
44.5%
G 286
44.3%
C 11
 
1.7%
I 6
 
0.9%
M 6
 
0.9%
L 5
 
0.8%
R 4
 
0.6%
K 4
 
0.6%
D 3
 
0.5%
H 3
 
0.5%
Other values (19) 30
 
4.7%
Common
ValueCountFrequency (%)
1084
51.9%
2 396
 
19.0%
5 294
 
14.1%
4 100
 
4.8%
( 96
 
4.6%
) 96
 
4.6%
1 8
 
0.4%
6 3
 
0.1%
3 3
 
0.1%
- 3
 
0.1%
Other values (5) 6
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9111
76.9%
ASCII 2734
 
23.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1084
39.6%
2 396
 
14.5%
5 294
 
10.8%
S 287
 
10.5%
G 286
 
10.5%
4 100
 
3.7%
( 96
 
3.5%
) 96
 
3.5%
C 11
 
0.4%
1 8
 
0.3%
Other values (34) 76
 
2.8%
Hangul
ValueCountFrequency (%)
972
 
10.7%
439
 
4.8%
338
 
3.7%
318
 
3.5%
308
 
3.4%
279
 
3.1%
252
 
2.8%
241
 
2.6%
225
 
2.5%
223
 
2.4%
Other values (393) 5516
60.5%

구역
Categorical

HIGH CORRELATION 

Distinct44
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size11.1 KiB
두정
132 
성정
130 
불당
96 
쌍용
94 
신부
 
76
Other values (39)
870 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique3 ?
Unique (%)0.2%

Sample

1st row 광덕
2nd row 광덕
3rd row 광덕
4th row 광덕
5th row 광덕

Common Values

ValueCountFrequency (%)
두정 132
 
9.4%
성정 130
 
9.3%
불당 96
 
6.9%
쌍용 94
 
6.7%
신부 76
 
5.4%
신방 74
 
5.3%
성환 67
 
4.8%
직산 67
 
4.8%
백석 54
 
3.9%
봉명 54
 
3.9%
Other values (34) 554
39.6%

Length

2024-04-21T11:04:50.656266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
두정 132
 
9.4%
성정 130
 
9.3%
불당 96
 
6.9%
쌍용 94
 
6.7%
신부 76
 
5.4%
신방 74
 
5.3%
성환 67
 
4.8%
직산 67
 
4.8%
백석 54
 
3.9%
봉명 54
 
3.9%
Other values (34) 554
39.6%

배송요일
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size11.1 KiB
328 
309 
271 
267 
216 
Other values (2)
 
7

Length

Max length4
Median length1
Mean length1.0064378
Min length1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
328
23.5%
309
22.1%
271
19.4%
267
19.1%
216
15.5%
현판 6
 
0.4%
<NA> 1
 
0.1%

Length

2024-04-21T11:04:50.768182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:04:50.880095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
328
23.5%
309
22.1%
271
19.4%
267
19.1%
216
15.5%
현판 6
 
0.4%
na 1
 
0.1%
Distinct1380
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size11.1 KiB
2024-04-21T11:04:51.211281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length45
Mean length25.003577
Min length13

Characters and Unicode

Total characters34955
Distinct characters449
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1364 ?
Unique (%)97.6%

Sample

1st row천안시 동남구 광덕면 광풍로905(광덕부동산)
2nd row천안시 동남구 신흥리4길 49
3rd row천안시 동남구 광덕면 행정길 1
4th row천안시 동남구 광덕면 해사동길 15 (광덕사관광농원가든)
5th row천안시 동남구 신흥리3길 25
ValueCountFrequency (%)
천안시 1390
 
19.4%
서북구 780
 
10.9%
동남구 608
 
8.5%
충청남도 129
 
1.8%
두정동 72
 
1.0%
성정동 62
 
0.9%
직산읍 58
 
0.8%
불당동 55
 
0.8%
성환읍 53
 
0.7%
목천읍 46
 
0.6%
Other values (1889) 3896
54.5%
2024-04-21T11:04:51.650701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6586
 
18.8%
1617
 
4.6%
1569
 
4.5%
1479
 
4.2%
1448
 
4.1%
1435
 
4.1%
1 1306
 
3.7%
( 948
 
2.7%
) 946
 
2.7%
909
 
2.6%
Other values (439) 16712
47.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20690
59.2%
Space Separator 6586
 
18.8%
Decimal Number 5012
 
14.3%
Open Punctuation 948
 
2.7%
Close Punctuation 946
 
2.7%
Other Punctuation 426
 
1.2%
Dash Punctuation 259
 
0.7%
Uppercase Letter 72
 
0.2%
Lowercase Letter 11
 
< 0.1%
Math Symbol 3
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1617
 
7.8%
1569
 
7.6%
1479
 
7.1%
1448
 
7.0%
1435
 
6.9%
909
 
4.4%
799
 
3.9%
778
 
3.8%
722
 
3.5%
680
 
3.3%
Other values (391) 9254
44.7%
Uppercase Letter
ValueCountFrequency (%)
A 10
13.9%
S 10
13.9%
G 9
12.5%
C 6
 
8.3%
T 5
 
6.9%
L 4
 
5.6%
B 3
 
4.2%
P 3
 
4.2%
E 3
 
4.2%
M 3
 
4.2%
Other values (12) 16
22.2%
Decimal Number
ValueCountFrequency (%)
1 1306
26.1%
2 742
14.8%
3 557
11.1%
4 435
 
8.7%
5 379
 
7.6%
0 374
 
7.5%
6 335
 
6.7%
7 313
 
6.2%
8 307
 
6.1%
9 264
 
5.3%
Lowercase Letter
ValueCountFrequency (%)
e 6
54.5%
b 2
 
18.2%
c 1
 
9.1%
s 1
 
9.1%
f 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
, 416
97.7%
. 7
 
1.6%
@ 2
 
0.5%
/ 1
 
0.2%
Space Separator
ValueCountFrequency (%)
6586
100.0%
Open Punctuation
ValueCountFrequency (%)
( 948
100.0%
Close Punctuation
ValueCountFrequency (%)
) 946
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 259
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20691
59.2%
Common 14180
40.6%
Latin 84
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1617
 
7.8%
1569
 
7.6%
1479
 
7.1%
1448
 
7.0%
1435
 
6.9%
909
 
4.4%
799
 
3.9%
778
 
3.8%
722
 
3.5%
680
 
3.3%
Other values (392) 9255
44.7%
Latin
ValueCountFrequency (%)
A 10
 
11.9%
S 10
 
11.9%
G 9
 
10.7%
e 6
 
7.1%
C 6
 
7.1%
T 5
 
6.0%
L 4
 
4.8%
B 3
 
3.6%
P 3
 
3.6%
E 3
 
3.6%
Other values (18) 25
29.8%
Common
ValueCountFrequency (%)
6586
46.4%
1 1306
 
9.2%
( 948
 
6.7%
) 946
 
6.7%
2 742
 
5.2%
3 557
 
3.9%
4 435
 
3.1%
, 416
 
2.9%
5 379
 
2.7%
0 374
 
2.6%
Other values (9) 1491
 
10.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20690
59.2%
ASCII 14263
40.8%
None 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6586
46.2%
1 1306
 
9.2%
( 948
 
6.6%
) 946
 
6.6%
2 742
 
5.2%
3 557
 
3.9%
4 435
 
3.0%
, 416
 
2.9%
5 379
 
2.7%
0 374
 
2.6%
Other values (36) 1574
 
11.0%
Hangul
ValueCountFrequency (%)
1617
 
7.8%
1569
 
7.6%
1479
 
7.1%
1448
 
7.0%
1435
 
6.9%
909
 
4.4%
799
 
3.9%
778
 
3.8%
722
 
3.5%
680
 
3.3%
Other values (391) 9254
44.7%
None
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

Correlations

2024-04-21T11:04:51.738343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구역배송요일
구역1.0001.000
배송요일1.0001.000
2024-04-21T11:04:51.810057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구역배송요일
구역1.0000.986
배송요일0.9861.000
2024-04-21T11:04:51.891875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구역배송요일
구역1.0000.986
배송요일0.9861.000

Missing values

2024-04-21T11:04:49.839233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:04:49.967159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호명구역배송요일도로명주소
0GS25 온누리점광덕천안시 동남구 광덕면 광풍로905(광덕부동산)
1광덕공판장광덕천안시 동남구 신흥리4길 49
2광덕상회광덕천안시 동남구 광덕면 행정길 1
3농원가든광덕천안시 동남구 광덕면 해사동길 15 (광덕사관광농원가든)
4농협 하나로마트 광덕지점광덕천안시 동남구 신흥리3길 25
5대성상회광덕천안시 동남구 광덕면 차령고개로 1055(행정리)
6매당고을광덕충청남도 천안시 동남구 광덕면 광풍로 765 (매당고을식당가든)
7세븐일레븐 로드점광덕천안시 동남구 광덕면 광풍로 729
8송암슈퍼광덕천안시 동남구 광덕면 해수길 96
9쉼터마트광덕천안시 동남구 광덕면 광풍로 250 101
상호명구역배송요일도로명주소
1388세븐일레븐 천안미죽공단점풍세충청남도 천안시 동남구 풍세면 미죽3길1(1층 마을회관)
1389세븐일레븐 천안풍세점풍세충남 천안시 풍세면 남관리87번지
1390씨유 남관점풍세천안시 동남구 풍세로466
1391씨유 센토피아점풍세천안시 동남구 풍세면 풍세산단로 4
1392씨유 천안풍세수자인풍세충청남도 천안시 동남구 풍세면 풍세산단로 287 1층 제108호
1393이마트24 R산단점풍세천안시 동남구 풍세면 풍세산단4로 81
1394이마트24 천안풍세중앙점풍세충청남도 천안시 동남구 풍세면 풍세산단로 18-12
1395이마트24 풍세점풍세천안시 동남구 풍세면 풍세산단1로 54
1396풍세공판장풍세천안시 동남구 풍세면 풍세로 87
1397훼미리마트 풍세광국점풍세충청남도 천안시 동남구 풍세면 풍서리 448-2 1층