Overview

Dataset statistics

Number of variables5
Number of observations165
Missing cells27
Missing cells (%)3.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.7 KiB
Average record size in memory41.8 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description대구광역시 서구_대형폐기물 스티커 판매소_20240213
Author대구광역시 서구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15055768&dataSetDetailId=150557681b39d0b6a20e5&provdMethod=FILE

Alerts

연번 is highly overall correlated with 소재지High correlation
소재지 is highly overall correlated with 연번High correlation
전화번호 has 27 (16.4%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-13 14:20:21.647179
Analysis finished2024-03-13 14:20:22.203371
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct165
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83
Minimum1
Maximum165
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2024-03-13T23:20:22.283870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9.2
Q142
median83
Q3124
95-th percentile156.8
Maximum165
Range164
Interquartile range (IQR)82

Descriptive statistics

Standard deviation47.775517
Coefficient of variation (CV)0.57560864
Kurtosis-1.2
Mean83
Median Absolute Deviation (MAD)41
Skewness0
Sum13695
Variance2282.5
MonotonicityStrictly increasing
2024-03-13T23:20:22.432163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
105 1
 
0.6%
107 1
 
0.6%
108 1
 
0.6%
109 1
 
0.6%
110 1
 
0.6%
111 1
 
0.6%
112 1
 
0.6%
113 1
 
0.6%
114 1
 
0.6%
Other values (155) 155
93.9%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
165 1
0.6%
164 1
0.6%
163 1
0.6%
162 1
0.6%
161 1
0.6%
160 1
0.6%
159 1
0.6%
158 1
0.6%
157 1
0.6%
156 1
0.6%
Distinct151
Distinct (%)91.5%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-13T23:20:22.719806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length7.2666667
Min length3

Characters and Unicode

Total characters1199
Distinct characters186
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique142 ?
Unique (%)86.1%

Sample

1st row대백마트 내당점
2nd row미니마트
3rd row미카엘소비센터
4th row생큐마트비타민25
5th row세븐일레븐 대구내당이편한점
ValueCountFrequency (%)
세븐일레븐 13
 
5.4%
cu 10
 
4.1%
gs25 9
 
3.7%
대백마트 8
 
3.3%
이마트24 8
 
3.3%
홈마트 7
 
2.9%
신우유통 4
 
1.7%
평리점 4
 
1.7%
필마트 4
 
1.7%
나이스마트 4
 
1.7%
Other values (150) 171
70.7%
2024-03-13T23:20:23.181057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
112
 
9.3%
106
 
8.8%
77
 
6.4%
68
 
5.7%
58
 
4.8%
35
 
2.9%
29
 
2.4%
26
 
2.2%
23
 
1.9%
20
 
1.7%
Other values (176) 645
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1024
85.4%
Space Separator 77
 
6.4%
Decimal Number 45
 
3.8%
Uppercase Letter 45
 
3.8%
Open Punctuation 3
 
0.3%
Close Punctuation 3
 
0.3%
Other Punctuation 1
 
0.1%
Lowercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
112
 
10.9%
106
 
10.4%
68
 
6.6%
58
 
5.7%
35
 
3.4%
29
 
2.8%
26
 
2.5%
23
 
2.2%
20
 
2.0%
19
 
1.9%
Other values (158) 528
51.6%
Uppercase Letter
ValueCountFrequency (%)
U 10
22.2%
C 10
22.2%
S 10
22.2%
G 9
20.0%
K 3
 
6.7%
O 2
 
4.4%
R 1
 
2.2%
Decimal Number
ValueCountFrequency (%)
2 18
40.0%
5 10
22.2%
4 9
20.0%
0 4
 
8.9%
7 3
 
6.7%
8 1
 
2.2%
Space Separator
ValueCountFrequency (%)
77
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
k 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1024
85.4%
Common 129
 
10.8%
Latin 46
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
112
 
10.9%
106
 
10.4%
68
 
6.6%
58
 
5.7%
35
 
3.4%
29
 
2.8%
26
 
2.5%
23
 
2.2%
20
 
2.0%
19
 
1.9%
Other values (158) 528
51.6%
Common
ValueCountFrequency (%)
77
59.7%
2 18
 
14.0%
5 10
 
7.8%
4 9
 
7.0%
0 4
 
3.1%
7 3
 
2.3%
( 3
 
2.3%
) 3
 
2.3%
. 1
 
0.8%
8 1
 
0.8%
Latin
ValueCountFrequency (%)
U 10
21.7%
C 10
21.7%
S 10
21.7%
G 9
19.6%
K 3
 
6.5%
O 2
 
4.3%
k 1
 
2.2%
R 1
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1024
85.4%
ASCII 175
 
14.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
112
 
10.9%
106
 
10.4%
68
 
6.6%
58
 
5.7%
35
 
3.4%
29
 
2.8%
26
 
2.5%
23
 
2.2%
20
 
2.0%
19
 
1.9%
Other values (158) 528
51.6%
ASCII
ValueCountFrequency (%)
77
44.0%
2 18
 
10.3%
U 10
 
5.7%
C 10
 
5.7%
S 10
 
5.7%
5 10
 
5.7%
4 9
 
5.1%
G 9
 
5.1%
0 4
 
2.3%
7 3
 
1.7%
Other values (8) 15
 
8.6%

소재지
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)10.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
상중이동
16 
평리4동
16 
내당4동
15 
원대동
13 
평리3동
12 
Other values (13)
93 

Length

Max length6
Median length4
Mean length4.1151515
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row내당1동
2nd row내당1동
3rd row내당1동
4th row내당1동
5th row내당1동

Common Values

ValueCountFrequency (%)
상중이동 16
 
9.7%
평리4동 16
 
9.7%
내당4동 15
 
9.1%
원대동 13
 
7.9%
평리3동 12
 
7.3%
비산7동 12
 
7.3%
평리1동 10
 
6.1%
비산1동 10
 
6.1%
비산2.3동 9
 
5.5%
내당2.3동 8
 
4.8%
Other values (8) 44
26.7%

Length

2024-03-13T23:20:23.337467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
상중이동 16
 
9.7%
평리4동 16
 
9.7%
내당4동 15
 
9.1%
원대동 13
 
7.9%
평리3동 12
 
7.3%
비산7동 12
 
7.3%
평리1동 10
 
6.1%
비산1동 10
 
6.1%
비산2.3동 9
 
5.5%
내당2.3동 8
 
4.8%
Other values (8) 44
26.7%
Distinct164
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-13T23:20:23.622485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length19
Mean length11.084848
Min length6

Characters and Unicode

Total characters1829
Distinct characters65
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique163 ?
Unique (%)98.8%

Sample

1st row통학로 54
2nd row평리로 358-1
3rd row서대구로 6길 33
4th row달구벌대로 361길 7
5th row통학로 7길 29
ValueCountFrequency (%)
국채보상로 31
 
6.6%
서대구로 19
 
4.1%
통학로 19
 
4.1%
달서로 17
 
3.6%
문화로 11
 
2.4%
북비산로 11
 
2.4%
달서천로 10
 
2.1%
달구벌대로 9
 
1.9%
고성로 8
 
1.7%
평리로 8
 
1.7%
Other values (207) 324
69.4%
2024-03-13T23:20:24.065578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
302
16.5%
164
 
9.0%
122
 
6.7%
1 115
 
6.3%
3 97
 
5.3%
2 82
 
4.5%
4 63
 
3.4%
7 58
 
3.2%
5 57
 
3.1%
6 52
 
2.8%
Other values (55) 717
39.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 824
45.1%
Decimal Number 634
34.7%
Space Separator 302
 
16.5%
Dash Punctuation 26
 
1.4%
Close Punctuation 16
 
0.9%
Open Punctuation 16
 
0.9%
Other Punctuation 10
 
0.5%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
164
19.9%
122
14.8%
48
 
5.8%
39
 
4.7%
38
 
4.6%
32
 
3.9%
31
 
3.8%
31
 
3.8%
31
 
3.8%
29
 
3.5%
Other values (39) 259
31.4%
Decimal Number
ValueCountFrequency (%)
1 115
18.1%
3 97
15.3%
2 82
12.9%
4 63
9.9%
7 58
9.1%
5 57
9.0%
6 52
8.2%
0 42
 
6.6%
8 35
 
5.5%
9 33
 
5.2%
Space Separator
ValueCountFrequency (%)
302
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Other Punctuation
ValueCountFrequency (%)
, 10
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1004
54.9%
Hangul 824
45.1%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
164
19.9%
122
14.8%
48
 
5.8%
39
 
4.7%
38
 
4.6%
32
 
3.9%
31
 
3.8%
31
 
3.8%
31
 
3.8%
29
 
3.5%
Other values (39) 259
31.4%
Common
ValueCountFrequency (%)
302
30.1%
1 115
 
11.5%
3 97
 
9.7%
2 82
 
8.2%
4 63
 
6.3%
7 58
 
5.8%
5 57
 
5.7%
6 52
 
5.2%
0 42
 
4.2%
8 35
 
3.5%
Other values (5) 101
 
10.1%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1005
54.9%
Hangul 824
45.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
302
30.0%
1 115
 
11.4%
3 97
 
9.7%
2 82
 
8.2%
4 63
 
6.3%
7 58
 
5.8%
5 57
 
5.7%
6 52
 
5.2%
0 42
 
4.2%
8 35
 
3.5%
Other values (6) 102
 
10.1%
Hangul
ValueCountFrequency (%)
164
19.9%
122
14.8%
48
 
5.8%
39
 
4.7%
38
 
4.6%
32
 
3.9%
31
 
3.8%
31
 
3.8%
31
 
3.8%
29
 
3.5%
Other values (39) 259
31.4%

전화번호
Text

MISSING 

Distinct138
Distinct (%)100.0%
Missing27
Missing (%)16.4%
Memory size1.4 KiB
2024-03-13T23:20:24.407618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1656
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique138 ?
Unique (%)100.0%

Sample

1st row053-523-0010
2nd row053-555-1555
3rd row053-551-7375
4th row053-525-8525
5th row053-553-0746
ValueCountFrequency (%)
053-553-1638 1
 
0.7%
053-525-6335 1
 
0.7%
053-567-0786 1
 
0.7%
053-558-9562 1
 
0.7%
053-561-8486 1
 
0.7%
053-553-8585 1
 
0.7%
053-553-9935 1
 
0.7%
053-562-2256 1
 
0.7%
053-553-1921 1
 
0.7%
053-554-1114 1
 
0.7%
Other values (128) 128
92.8%
2024-03-13T23:20:25.015395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 378
22.8%
- 276
16.7%
3 234
14.1%
0 200
12.1%
2 136
 
8.2%
6 114
 
6.9%
7 80
 
4.8%
1 69
 
4.2%
8 61
 
3.7%
9 61
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1380
83.3%
Dash Punctuation 276
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 378
27.4%
3 234
17.0%
0 200
14.5%
2 136
 
9.9%
6 114
 
8.3%
7 80
 
5.8%
1 69
 
5.0%
8 61
 
4.4%
9 61
 
4.4%
4 47
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 276
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1656
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 378
22.8%
- 276
16.7%
3 234
14.1%
0 200
12.1%
2 136
 
8.2%
6 114
 
6.9%
7 80
 
4.8%
1 69
 
4.2%
8 61
 
3.7%
9 61
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1656
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 378
22.8%
- 276
16.7%
3 234
14.1%
0 200
12.1%
2 136
 
8.2%
6 114
 
6.9%
7 80
 
4.8%
1 69
 
4.2%
8 61
 
3.7%
9 61
 
3.7%

Interactions

2024-03-13T23:20:21.889509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T23:20:25.098347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소재지
연번1.0000.975
소재지0.9751.000
2024-03-13T23:20:25.170063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소재지
연번1.0000.847
소재지0.8471.000

Missing values

2024-03-13T23:20:22.044741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T23:20:22.156554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번판매지정업소소재지도로명주소전화번호
01대백마트 내당점내당1동통학로 54053-523-0010
12미니마트내당1동평리로 358-1053-555-1555
23미카엘소비센터내당1동서대구로 6길 33053-551-7375
34생큐마트비타민25내당1동달구벌대로 361길 7053-525-8525
45세븐일레븐 대구내당이편한점내당1동통학로 7길 29053-553-0746
56알뜰슈퍼내당1동서대구로 10길 52053-557-0652
67파워마트내당1동서대구로 8길 63053-571-1222
78GS25 내당달서로점내당2.3동달서로 12길 48053-566-4959
89GS25 대구새길점내당2.3동달서로 4길 51<NA>
910세븐일레븐 서문시장점내당2.3동큰장로 97-2<NA>
연번판매지정업소소재지도로명주소전화번호
155156세븐일레븐 대구북구청역원대동원대로 13길 2, 103호053-356-3626
156157세븐일레븐 서대구센트럴자이점원대동고성로 33, 상가 405동 106호<NA>
157158신우유통 원대점원대동고성로 15길 37053-351-1932
158159웰마트원대동달서천로 83길 27053-356-9330
159160유진유통원대동달서천로 425053-266-4560
160161필마트원대동원대로 13길 53053-719-4080
161162현대마트원대동달서천로 74길 2-1053-351-5765
162163화창할인마트원대동고성로 91053-353-9036
163164나이스마트 대신점대신동달성공원로 2053-292-5999
164165오케이포인트마트 대신점대신동달구벌대로 389길 56053-254-8003