Overview

Dataset statistics

Number of variables5
Number of observations122
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.0 KiB
Average record size in memory42.1 B

Variable types

Numeric1
Text2
Categorical2

Dataset

Description인천광역시 동구 관내에 위치한 쓰레기 종량제 봉투 판매소 데이터로, 판매소명, 주소, 동명, 영업상태 등 항목을 게시하였습니다.
Author인천광역시 동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15006277&srcSe=7661IVAWM27C61E190

Alerts

영업상태 has constant value ""Constant
번호 is highly overall correlated with 동명High correlation
동명 is highly overall correlated with 번호High correlation
번호 has unique valuesUnique

Reproduction

Analysis started2024-01-28 12:45:57.232534
Analysis finished2024-01-28 12:45:57.688481
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct122
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean61.5
Minimum1
Maximum122
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-01-28T21:45:57.964702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.05
Q131.25
median61.5
Q391.75
95-th percentile115.95
Maximum122
Range121
Interquartile range (IQR)60.5

Descriptive statistics

Standard deviation35.362409
Coefficient of variation (CV)0.57499853
Kurtosis-1.2
Mean61.5
Median Absolute Deviation (MAD)30.5
Skewness0
Sum7503
Variance1250.5
MonotonicityStrictly increasing
2024-01-28T21:45:58.072170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
93 1
 
0.8%
91 1
 
0.8%
90 1
 
0.8%
89 1
 
0.8%
88 1
 
0.8%
87 1
 
0.8%
86 1
 
0.8%
85 1
 
0.8%
84 1
 
0.8%
Other values (112) 112
91.8%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
122 1
0.8%
121 1
0.8%
120 1
0.8%
119 1
0.8%
118 1
0.8%
117 1
0.8%
116 1
0.8%
115 1
0.8%
114 1
0.8%
113 1
0.8%
Distinct121
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-01-28T21:45:58.328639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length6.9508197
Min length3

Characters and Unicode

Total characters848
Distinct characters193
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique120 ?
Unique (%)98.4%

Sample

1st row유성상회
2nd row선진슈퍼
3rd row만석슈퍼
4th row주공할인마트
5th row엄마손마트
ValueCountFrequency (%)
씨유 10
 
5.8%
gs25 8
 
4.7%
세븐일레븐 8
 
4.7%
cu 4
 
2.3%
지에스25 4
 
2.3%
마트 3
 
1.7%
평화슈퍼 2
 
1.2%
이마트24 2
 
1.2%
송현하늘점 2
 
1.2%
송림점 2
 
1.2%
Other values (127) 127
73.8%
2024-01-28T21:45:58.672251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
50
 
5.9%
41
 
4.8%
38
 
4.5%
35
 
4.1%
24
 
2.8%
21
 
2.5%
19
 
2.2%
19
 
2.2%
18
 
2.1%
2 18
 
2.1%
Other values (183) 565
66.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 712
84.0%
Space Separator 50
 
5.9%
Decimal Number 40
 
4.7%
Uppercase Letter 34
 
4.0%
Open Punctuation 4
 
0.5%
Close Punctuation 4
 
0.5%
Lowercase Letter 4
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
5.8%
38
 
5.3%
35
 
4.9%
24
 
3.4%
21
 
2.9%
19
 
2.7%
19
 
2.7%
18
 
2.5%
18
 
2.5%
17
 
2.4%
Other values (167) 462
64.9%
Uppercase Letter
ValueCountFrequency (%)
S 11
32.4%
G 10
29.4%
C 5
14.7%
U 4
 
11.8%
T 2
 
5.9%
D 2
 
5.9%
Decimal Number
ValueCountFrequency (%)
2 18
45.0%
5 14
35.0%
4 4
 
10.0%
0 3
 
7.5%
1 1
 
2.5%
Lowercase Letter
ValueCountFrequency (%)
e 2
50.0%
h 2
50.0%
Space Separator
ValueCountFrequency (%)
50
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 712
84.0%
Common 98
 
11.6%
Latin 38
 
4.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
5.8%
38
 
5.3%
35
 
4.9%
24
 
3.4%
21
 
2.9%
19
 
2.7%
19
 
2.7%
18
 
2.5%
18
 
2.5%
17
 
2.4%
Other values (167) 462
64.9%
Common
ValueCountFrequency (%)
50
51.0%
2 18
 
18.4%
5 14
 
14.3%
( 4
 
4.1%
) 4
 
4.1%
4 4
 
4.1%
0 3
 
3.1%
1 1
 
1.0%
Latin
ValueCountFrequency (%)
S 11
28.9%
G 10
26.3%
C 5
13.2%
U 4
 
10.5%
e 2
 
5.3%
h 2
 
5.3%
T 2
 
5.3%
D 2
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 712
84.0%
ASCII 136
 
16.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
50
36.8%
2 18
 
13.2%
5 14
 
10.3%
S 11
 
8.1%
G 10
 
7.4%
C 5
 
3.7%
( 4
 
2.9%
) 4
 
2.9%
U 4
 
2.9%
4 4
 
2.9%
Other values (6) 12
 
8.8%
Hangul
ValueCountFrequency (%)
41
 
5.8%
38
 
5.3%
35
 
4.9%
24
 
3.4%
21
 
2.9%
19
 
2.7%
19
 
2.7%
18
 
2.5%
18
 
2.5%
17
 
2.4%
Other values (167) 462
64.9%

주소
Text

Distinct121
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-01-28T21:45:58.855186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length38
Mean length27.196721
Min length17

Characters and Unicode

Total characters3318
Distinct characters130
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique120 ?
Unique (%)98.4%

Sample

1st row인천광역시 동구 제물량로 341번길 13(만석동)
2nd row인천광역시 동구 만석부두로 2(만석동)
3rd row인천광역시 동구 만석부두로 1(만석동)
4th row인천광역시 동구 화도진로 187 만석비치APT상가 A동 102호(만석동)
5th row인천광역시 동구 화도진로 187 만석비치APT상가 B동 103호(만석동)
ValueCountFrequency (%)
인천광역시 122
21.5%
동구 122
21.5%
송림6동 10
 
1.8%
송현로 9
 
1.6%
샛골로 8
 
1.4%
화도진로 8
 
1.4%
화수로 6
 
1.1%
금곡로 5
 
0.9%
방축로 4
 
0.7%
솔빛로 4
 
0.7%
Other values (227) 270
47.5%
2024-01-28T21:45:59.151313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
446
 
13.4%
256
 
7.7%
1 160
 
4.8%
131
 
3.9%
128
 
3.9%
124
 
3.7%
123
 
3.7%
122
 
3.7%
122
 
3.7%
122
 
3.7%
Other values (120) 1584
47.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1945
58.6%
Decimal Number 570
 
17.2%
Space Separator 446
 
13.4%
Open Punctuation 118
 
3.6%
Close Punctuation 118
 
3.6%
Other Punctuation 75
 
2.3%
Uppercase Letter 23
 
0.7%
Dash Punctuation 21
 
0.6%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
256
 
13.2%
131
 
6.7%
128
 
6.6%
124
 
6.4%
123
 
6.3%
122
 
6.3%
122
 
6.3%
122
 
6.3%
112
 
5.8%
63
 
3.2%
Other values (97) 642
33.0%
Decimal Number
ValueCountFrequency (%)
1 160
28.1%
2 87
15.3%
3 80
14.0%
0 50
 
8.8%
4 38
 
6.7%
6 37
 
6.5%
7 35
 
6.1%
5 32
 
5.6%
8 28
 
4.9%
9 23
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
A 7
30.4%
B 6
26.1%
P 4
17.4%
T 4
17.4%
C 1
 
4.3%
E 1
 
4.3%
Other Punctuation
ValueCountFrequency (%)
, 56
74.7%
. 19
 
25.3%
Space Separator
ValueCountFrequency (%)
446
100.0%
Open Punctuation
ValueCountFrequency (%)
( 118
100.0%
Close Punctuation
ValueCountFrequency (%)
) 118
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1945
58.6%
Common 1350
40.7%
Latin 23
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
256
 
13.2%
131
 
6.7%
128
 
6.6%
124
 
6.4%
123
 
6.3%
122
 
6.3%
122
 
6.3%
122
 
6.3%
112
 
5.8%
63
 
3.2%
Other values (97) 642
33.0%
Common
ValueCountFrequency (%)
446
33.0%
1 160
 
11.9%
( 118
 
8.7%
) 118
 
8.7%
2 87
 
6.4%
3 80
 
5.9%
, 56
 
4.1%
0 50
 
3.7%
4 38
 
2.8%
6 37
 
2.7%
Other values (7) 160
 
11.9%
Latin
ValueCountFrequency (%)
A 7
30.4%
B 6
26.1%
P 4
17.4%
T 4
17.4%
C 1
 
4.3%
E 1
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1945
58.6%
ASCII 1373
41.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
446
32.5%
1 160
 
11.7%
( 118
 
8.6%
) 118
 
8.6%
2 87
 
6.3%
3 80
 
5.8%
, 56
 
4.1%
0 50
 
3.6%
4 38
 
2.8%
6 37
 
2.7%
Other values (13) 183
13.3%
Hangul
ValueCountFrequency (%)
256
 
13.2%
131
 
6.7%
128
 
6.6%
124
 
6.4%
123
 
6.3%
122
 
6.3%
122
 
6.3%
122
 
6.3%
112
 
5.8%
63
 
3.2%
Other values (97) 642
33.0%

동명
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
송현1.2동
18 
송림4동
16 
송림6동
16 
화수2동
12 
화수1.화평동
11 
Other values (6)
49 

Length

Max length7
Median length4
Mean length4.5983607
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row만석동
2nd row만석동
3rd row만석동
4th row만석동
5th row만석동

Common Values

ValueCountFrequency (%)
송현1.2동 18
14.8%
송림4동 16
13.1%
송림6동 16
13.1%
화수2동 12
9.8%
화수1.화평동 11
9.0%
송림2동 11
9.0%
송림3.5동 11
9.0%
만석동 10
8.2%
금창동 8
6.6%
송현3동 6
 
4.9%

Length

2024-01-28T21:45:59.263856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
송현1.2동 18
14.8%
송림4동 16
13.1%
송림6동 16
13.1%
화수2동 12
9.8%
화수1.화평동 11
9.0%
송림2동 11
9.0%
송림3.5동 11
9.0%
만석동 10
8.2%
금창동 8
6.6%
송현3동 6
 
4.9%

영업상태
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
영업중
122 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업중
2nd row영업중
3rd row영업중
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 122
100.0%

Length

2024-01-28T21:45:59.359675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T21:45:59.431107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 122
100.0%

Interactions

2024-01-28T21:45:57.455078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T21:45:59.475466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호동명
번호1.0000.946
동명0.9461.000
2024-01-28T21:45:59.544649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호동명
번호1.0000.791
동명0.7911.000

Missing values

2024-01-28T21:45:57.576912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T21:45:57.659748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호판매소명주소동명영업상태
01유성상회인천광역시 동구 제물량로 341번길 13(만석동)만석동영업중
12선진슈퍼인천광역시 동구 만석부두로 2(만석동)만석동영업중
23만석슈퍼인천광역시 동구 만석부두로 1(만석동)만석동영업중
34주공할인마트인천광역시 동구 화도진로 187 만석비치APT상가 A동 102호(만석동)만석동영업중
45엄마손마트인천광역시 동구 화도진로 187 만석비치APT상가 B동 103호(만석동)만석동영업중
56현상회인천광역시 동구 석수로 1(만석동)만석동영업중
67강경농산물인천광역시 동구 화도진로 178(만석동)만석동영업중
78큐브24 만석인천광역시 동구 보세로 21(만석동), 1층만석동영업중
89CU 만석웰카운티점인천광역시 동구 화도진로 142 만석웰카운티APT상가 1층04호(만석동)만석동영업중
910CU 만석부두점인천광역시 동구 보세로 19(만석동)만석동영업중
번호판매소명주소동명영업상태
112113GS25 송림아이원점인천광역시 동구 송미로24번길17-8(송림동)송림6동영업중
113114현대시장상인회인천광역시 동구 샛골로 162 (송림6동)송림6동영업중
114115근면슈퍼인천광역시 동구 우각로15번길 35(금곡동)금창동영업중
115116현대슈퍼인천광역시 동구 금송로7번길 13(금곡동)금창동영업중
116117올포원인천광역시 동구 우각로 44(창영동)금창동영업중
117118오렌지마트인천광역시 동구 송림로 24, 110호(금곡동, 두손피카디리)금창동영업중
118119세븐일레븐 인천금곡점인천광역시 동구 금곡로 42(금곡동)금창동영업중
119120GS25 동구금곡점인천광역시 동구 금곡로 47(금곡동)금창동영업중
120121GS25 인천세무서점인천광역시 동구 샛골로 85(창영동)금창동영업중
121122(주)그린비지니스인천광역시 동구 금곡로 15-1, 2층(창영동)금창동영업중