Overview

Dataset statistics

Number of variables4
Number of observations636
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.6 KiB
Average record size in memory33.2 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description광주광역시 서구 관내 종량제봉투판매업소에 대한 판매소명, 전화번호, 사업장 주소(도로명주소) 등에 관한 정보를 제공합니다.
Author광주광역시 서구
URLhttps://www.data.go.kr/data/15035588/fileData.do

Alerts

전화번호 is highly imbalanced (86.7%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:13:02.410012
Analysis finished2024-04-06 08:13:03.718860
Duration1.31 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct636
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean318.5
Minimum1
Maximum636
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.7 KiB
2024-04-06T17:13:03.865060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile32.75
Q1159.75
median318.5
Q3477.25
95-th percentile604.25
Maximum636
Range635
Interquartile range (IQR)317.5

Descriptive statistics

Standard deviation183.74167
Coefficient of variation (CV)0.57689691
Kurtosis-1.2
Mean318.5
Median Absolute Deviation (MAD)159
Skewness0
Sum202566
Variance33761
MonotonicityStrictly increasing
2024-04-06T17:13:04.206004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
429 1
 
0.2%
422 1
 
0.2%
423 1
 
0.2%
424 1
 
0.2%
425 1
 
0.2%
426 1
 
0.2%
427 1
 
0.2%
428 1
 
0.2%
430 1
 
0.2%
Other values (626) 626
98.4%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
636 1
0.2%
635 1
0.2%
634 1
0.2%
633 1
0.2%
632 1
0.2%
631 1
0.2%
630 1
0.2%
629 1
0.2%
628 1
0.2%
627 1
0.2%
Distinct621
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
2024-04-06T17:13:04.786819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length8.9559748
Min length2

Characters and Unicode

Total characters5696
Distinct characters367
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique608 ?
Unique (%)95.6%

Sample

1st row지에스25상무마륵점
2nd row데일리365마켓 동천점
3rd row씨유 쌍촌광명점
4th row씨유 광주동천마을점
5th row다운다운마트
ValueCountFrequency (%)
씨유 76
 
7.7%
세븐일레븐 72
 
7.3%
이마트24 38
 
3.8%
gs25 36
 
3.6%
지에스25 21
 
2.1%
주식회사 17
 
1.7%
지에스(gs)25 12
 
1.2%
유한회사 8
 
0.8%
초록마을 7
 
0.7%
광주풍암점 6
 
0.6%
Other values (621) 698
70.4%
2024-04-06T17:13:05.634231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
398
 
7.0%
360
 
6.3%
224
 
3.9%
198
 
3.5%
195
 
3.4%
186
 
3.3%
158
 
2.8%
134
 
2.4%
2 131
 
2.3%
114
 
2.0%
Other values (357) 3598
63.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4719
82.8%
Space Separator 360
 
6.3%
Decimal Number 279
 
4.9%
Uppercase Letter 177
 
3.1%
Close Punctuation 76
 
1.3%
Open Punctuation 71
 
1.2%
Lowercase Letter 9
 
0.2%
Other Punctuation 3
 
0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
398
 
8.4%
224
 
4.7%
198
 
4.2%
195
 
4.1%
186
 
3.9%
158
 
3.3%
134
 
2.8%
114
 
2.4%
112
 
2.4%
106
 
2.2%
Other values (318) 2894
61.3%
Uppercase Letter
ValueCountFrequency (%)
S 64
36.2%
G 57
32.2%
C 18
 
10.2%
U 9
 
5.1%
D 6
 
3.4%
B 3
 
1.7%
M 3
 
1.7%
K 2
 
1.1%
A 2
 
1.1%
F 2
 
1.1%
Other values (8) 11
 
6.2%
Lowercase Letter
ValueCountFrequency (%)
e 2
22.2%
t 1
11.1%
i 1
11.1%
v 1
11.1%
a 1
11.1%
f 1
11.1%
l 1
11.1%
s 1
11.1%
Decimal Number
ValueCountFrequency (%)
2 131
47.0%
5 83
29.7%
4 43
 
15.4%
3 6
 
2.2%
0 6
 
2.2%
6 5
 
1.8%
1 5
 
1.8%
Other Punctuation
ValueCountFrequency (%)
& 2
66.7%
, 1
33.3%
Space Separator
ValueCountFrequency (%)
360
100.0%
Close Punctuation
ValueCountFrequency (%)
) 76
100.0%
Open Punctuation
ValueCountFrequency (%)
( 71
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4721
82.9%
Common 789
 
13.9%
Latin 186
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
398
 
8.4%
224
 
4.7%
198
 
4.2%
195
 
4.1%
186
 
3.9%
158
 
3.3%
134
 
2.8%
114
 
2.4%
112
 
2.4%
106
 
2.2%
Other values (319) 2896
61.3%
Latin
ValueCountFrequency (%)
S 64
34.4%
G 57
30.6%
C 18
 
9.7%
U 9
 
4.8%
D 6
 
3.2%
B 3
 
1.6%
M 3
 
1.6%
K 2
 
1.1%
A 2
 
1.1%
F 2
 
1.1%
Other values (16) 20
 
10.8%
Common
ValueCountFrequency (%)
360
45.6%
2 131
 
16.6%
5 83
 
10.5%
) 76
 
9.6%
( 71
 
9.0%
4 43
 
5.4%
3 6
 
0.8%
0 6
 
0.8%
6 5
 
0.6%
1 5
 
0.6%
Other values (2) 3
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4719
82.8%
ASCII 975
 
17.1%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
398
 
8.4%
224
 
4.7%
198
 
4.2%
195
 
4.1%
186
 
3.9%
158
 
3.3%
134
 
2.8%
114
 
2.4%
112
 
2.4%
106
 
2.2%
Other values (318) 2894
61.3%
ASCII
ValueCountFrequency (%)
360
36.9%
2 131
 
13.4%
5 83
 
8.5%
) 76
 
7.8%
( 71
 
7.3%
S 64
 
6.6%
G 57
 
5.8%
4 43
 
4.4%
C 18
 
1.8%
U 9
 
0.9%
Other values (28) 63
 
6.5%
None
ValueCountFrequency (%)
2
100.0%

전화번호
Categorical

IMBALANCE 

Distinct44
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
데이터미수집
593 
062-522-8292
 
1
062-382-5868
 
1
062-365-6990
 
1
062-351-3938
 
1
Other values (39)
 
39

Length

Max length13
Median length6
Mean length6.4072327
Min length6

Unique

Unique43 ?
Unique (%)6.8%

Sample

1st row데이터미수집
2nd row062-522-8292
3rd row데이터미수집
4th row데이터미수집
5th row062-365-6990

Common Values

ValueCountFrequency (%)
데이터미수집 593
93.2%
062-522-8292 1
 
0.2%
062-382-5868 1
 
0.2%
062-365-6990 1
 
0.2%
062-351-3938 1
 
0.2%
062-512-1133 1
 
0.2%
048-5150-2256 1
 
0.2%
062-383-6205 1
 
0.2%
062-682-6250 1
 
0.2%
062-371-8838 1
 
0.2%
Other values (34) 34
 
5.3%

Length

2024-04-06T17:13:05.938459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
데이터미수집 593
93.2%
062-522-8292 1
 
0.2%
062-366-6348 1
 
0.2%
062-381-0711 1
 
0.2%
062-226-1662 1
 
0.2%
062-373-3407 1
 
0.2%
062-372-3992 1
 
0.2%
062-655-6255 1
 
0.2%
062-385-1703 1
 
0.2%
062-376-0712 1
 
0.2%
Other values (34) 34
 
5.3%
Distinct606
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
2024-04-06T17:13:06.537338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length38
Mean length22.803459
Min length6

Characters and Unicode

Total characters14503
Distinct characters210
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique580 ?
Unique (%)91.2%

Sample

1st row광주광역시 서구 신기길 1, 101호(마륵동)
2nd row광주광역시 서구 하남대로710번길 6 1층, 데일리365마켓 동천점
3rd row광주광역시 서구 상일로54번길 8
4th row광주광역시 서구 동천로18번길 19, 제분산상가동 101호, 102호
5th row광주광역시 남구 월산로 147-1
ValueCountFrequency (%)
광주광역시 630
22.0%
서구 593
20.7%
1층 54
 
1.9%
상가동 19
 
0.7%
상무대로 17
 
0.6%
8 16
 
0.6%
시청로 16
 
0.6%
치평로 16
 
0.6%
화정로 16
 
0.6%
북구 16
 
0.6%
Other values (748) 1466
51.3%
2024-04-06T17:13:07.523524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2566
17.7%
1285
 
8.9%
1 776
 
5.4%
669
 
4.6%
653
 
4.5%
636
 
4.4%
636
 
4.4%
613
 
4.2%
601
 
4.1%
2 336
 
2.3%
Other values (200) 5732
39.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8578
59.1%
Decimal Number 2641
 
18.2%
Space Separator 2566
 
17.7%
Open Punctuation 212
 
1.5%
Close Punctuation 212
 
1.5%
Other Punctuation 206
 
1.4%
Dash Punctuation 76
 
0.5%
Uppercase Letter 11
 
0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1285
15.0%
669
 
7.8%
653
 
7.6%
636
 
7.4%
636
 
7.4%
613
 
7.1%
601
 
7.0%
290
 
3.4%
267
 
3.1%
255
 
3.0%
Other values (176) 2673
31.2%
Decimal Number
ValueCountFrequency (%)
1 776
29.4%
2 336
12.7%
0 256
 
9.7%
4 235
 
8.9%
3 206
 
7.8%
7 173
 
6.6%
8 172
 
6.5%
5 167
 
6.3%
9 165
 
6.2%
6 155
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
C 3
27.3%
B 3
27.3%
A 1
 
9.1%
P 1
 
9.1%
T 1
 
9.1%
N 1
 
9.1%
E 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
, 205
99.5%
. 1
 
0.5%
Space Separator
ValueCountFrequency (%)
2566
100.0%
Open Punctuation
ValueCountFrequency (%)
( 212
100.0%
Close Punctuation
ValueCountFrequency (%)
) 212
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 76
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8578
59.1%
Common 5913
40.8%
Latin 12
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1285
15.0%
669
 
7.8%
653
 
7.6%
636
 
7.4%
636
 
7.4%
613
 
7.1%
601
 
7.0%
290
 
3.4%
267
 
3.1%
255
 
3.0%
Other values (176) 2673
31.2%
Common
ValueCountFrequency (%)
2566
43.4%
1 776
 
13.1%
2 336
 
5.7%
0 256
 
4.3%
4 235
 
4.0%
( 212
 
3.6%
) 212
 
3.6%
3 206
 
3.5%
, 205
 
3.5%
7 173
 
2.9%
Other values (6) 736
 
12.4%
Latin
ValueCountFrequency (%)
C 3
25.0%
B 3
25.0%
A 1
 
8.3%
P 1
 
8.3%
T 1
 
8.3%
N 1
 
8.3%
e 1
 
8.3%
E 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8578
59.1%
ASCII 5925
40.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2566
43.3%
1 776
 
13.1%
2 336
 
5.7%
0 256
 
4.3%
4 235
 
4.0%
( 212
 
3.6%
) 212
 
3.6%
3 206
 
3.5%
, 205
 
3.5%
7 173
 
2.9%
Other values (14) 748
 
12.6%
Hangul
ValueCountFrequency (%)
1285
15.0%
669
 
7.8%
653
 
7.6%
636
 
7.4%
636
 
7.4%
613
 
7.1%
601
 
7.0%
290
 
3.4%
267
 
3.1%
255
 
3.0%
Other values (176) 2673
31.2%

Interactions

2024-04-06T17:13:03.213633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:13:07.732702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번전화번호
연번1.0000.076
전화번호0.0761.000
2024-04-06T17:13:07.927217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번전화번호
연번1.0000.021
전화번호0.0211.000

Missing values

2024-04-06T17:13:03.447180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:13:03.619313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번판매소명전화번호도로명주소
01지에스25상무마륵점데이터미수집광주광역시 서구 신기길 1, 101호(마륵동)
12데일리365마켓 동천점062-522-8292광주광역시 서구 하남대로710번길 6 1층, 데일리365마켓 동천점
23씨유 쌍촌광명점데이터미수집광주광역시 서구 상일로54번길 8
34씨유 광주동천마을점데이터미수집광주광역시 서구 동천로18번길 19, 제분산상가동 101호, 102호
45다운다운마트062-365-6990광주광역시 남구 월산로 147-1
56씨유 화정우량점데이터미수집광주광역시 서구 화정로 84
67씨유 상무희망점데이터미수집광주광역시 서구 상무중앙로34번길 12
78세븐일레븐 광주상무청연점데이터미수집광주광역시 서구 상무중앙로 64
89CU 광주광천점데이터미수집광주광역시 서구 천변좌로 56
910장봐주는언니 상무세정점데이터미수집광주광역시 서구 마륵로 132
연번판매소명전화번호도로명주소
626627햇살유통데이터미수집광주광역시 서구 금호운천길 80-1(쌍촌동)
627628행복마트데이터미수집광주광역시 서구 독립로 193(양동)
628629현대리빙(주)해피1,000데이터미수집광주광역시 서구 월드컵4강로 90(화정동)
629630현대슈퍼데이터미수집광주광역시 서구 상무대로 871
630631홈마트데이터미수집광주광역시 서구 염화로 75-1, 1층
631632화순상회데이터미수집데이터미수집
632633화정DC마트데이터미수집광주광역시 서구 상무대로 1083
633634화정마트데이터미수집광주광역시 서구 염화로45번길 13
634635힐마트데이터미수집광주광역시 서구 상무오월로52번길 3
635636(주)광주신세계데이터미수집광주광역시 서구 무진대로 932