Overview

Dataset statistics

Number of variables5
Number of observations500
Missing cells163
Missing cells (%)6.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.1 KiB
Average record size in memory41.3 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description울산중구도시관리공단에서 관리하고 있는 울산 중구 관내 종량제봉투 판매소 현황 정보임.
Author울산광역시중구도시관리공단
URLhttps://www.data.go.kr/data/15005932/fileData.do

Alerts

데이터기준일자 is highly imbalanced (97.9%)Imbalance
연락처 has 163 (32.6%) missing valuesMissing
관리번호 has unique valuesUnique
업체명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:30:54.451722
Analysis finished2023-12-12 06:30:55.191341
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관리번호
Real number (ℝ)

UNIQUE 

Distinct500
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean250.5
Minimum1
Maximum500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-12T15:30:55.294293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile25.95
Q1125.75
median250.5
Q3375.25
95-th percentile475.05
Maximum500
Range499
Interquartile range (IQR)249.5

Descriptive statistics

Standard deviation144.48183
Coefficient of variation (CV)0.57677378
Kurtosis-1.2
Mean250.5
Median Absolute Deviation (MAD)125
Skewness0
Sum125250
Variance20875
MonotonicityStrictly increasing
2023-12-12T15:30:55.512914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
331 1
 
0.2%
344 1
 
0.2%
343 1
 
0.2%
342 1
 
0.2%
341 1
 
0.2%
340 1
 
0.2%
339 1
 
0.2%
338 1
 
0.2%
337 1
 
0.2%
Other values (490) 490
98.0%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
500 1
0.2%
499 1
0.2%
498 1
0.2%
497 1
0.2%
496 1
0.2%
495 1
0.2%
494 1
0.2%
493 1
0.2%
492 1
0.2%
491 1
0.2%

업체명
Text

UNIQUE 

Distinct500
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-12T15:30:55.927810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length18
Mean length8.256
Min length3

Characters and Unicode

Total characters4128
Distinct characters319
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique500 ?
Unique (%)100.0%

Sample

1st row지에스(GS)25 테화강정원점
2nd row수전철물
3rd row씨유 남외동천
4th row세븐일레븐 울산학산점
5th row진화기업사
ValueCountFrequency (%)
씨유 18
 
2.8%
세븐일레븐 17
 
2.7%
gs25 8
 
1.3%
이마트24 6
 
0.9%
미니스톱 6
 
0.9%
주식회사 5
 
0.8%
현대홈마트 4
 
0.6%
성남점 4
 
0.6%
학성점 4
 
0.6%
cu 4
 
0.6%
Other values (533) 557
88.0%
2023-12-12T15:30:56.538191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
247
 
6.0%
178
 
4.3%
169
 
4.1%
( 154
 
3.7%
) 154
 
3.7%
133
 
3.2%
125
 
3.0%
100
 
2.4%
2 78
 
1.9%
73
 
1.8%
Other values (309) 2717
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3345
81.0%
Uppercase Letter 176
 
4.3%
Decimal Number 158
 
3.8%
Open Punctuation 154
 
3.7%
Close Punctuation 154
 
3.7%
Space Separator 133
 
3.2%
Other Punctuation 4
 
0.1%
Other Symbol 2
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
247
 
7.4%
178
 
5.3%
169
 
5.1%
125
 
3.7%
100
 
3.0%
73
 
2.2%
71
 
2.1%
66
 
2.0%
65
 
1.9%
62
 
1.9%
Other values (280) 2189
65.4%
Uppercase Letter
ValueCountFrequency (%)
G 40
22.7%
S 40
22.7%
C 38
21.6%
U 28
15.9%
D 4
 
2.3%
L 4
 
2.3%
N 3
 
1.7%
H 3
 
1.7%
M 3
 
1.7%
B 3
 
1.7%
Other values (6) 10
 
5.7%
Decimal Number
ValueCountFrequency (%)
2 78
49.4%
5 50
31.6%
4 18
 
11.4%
1 9
 
5.7%
0 3
 
1.9%
Other Punctuation
ValueCountFrequency (%)
. 3
75.0%
& 1
 
25.0%
Lowercase Letter
ValueCountFrequency (%)
s 1
50.0%
g 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 154
100.0%
Close Punctuation
ValueCountFrequency (%)
) 154
100.0%
Space Separator
ValueCountFrequency (%)
133
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3347
81.1%
Common 603
 
14.6%
Latin 178
 
4.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
247
 
7.4%
178
 
5.3%
169
 
5.0%
125
 
3.7%
100
 
3.0%
73
 
2.2%
71
 
2.1%
66
 
2.0%
65
 
1.9%
62
 
1.9%
Other values (281) 2191
65.5%
Latin
ValueCountFrequency (%)
G 40
22.5%
S 40
22.5%
C 38
21.3%
U 28
15.7%
D 4
 
2.2%
L 4
 
2.2%
N 3
 
1.7%
H 3
 
1.7%
M 3
 
1.7%
B 3
 
1.7%
Other values (8) 12
 
6.7%
Common
ValueCountFrequency (%)
( 154
25.5%
) 154
25.5%
133
22.1%
2 78
12.9%
5 50
 
8.3%
4 18
 
3.0%
1 9
 
1.5%
. 3
 
0.5%
0 3
 
0.5%
& 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3345
81.0%
ASCII 781
 
18.9%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
247
 
7.4%
178
 
5.3%
169
 
5.1%
125
 
3.7%
100
 
3.0%
73
 
2.2%
71
 
2.1%
66
 
2.0%
65
 
1.9%
62
 
1.9%
Other values (280) 2189
65.4%
ASCII
ValueCountFrequency (%)
( 154
19.7%
) 154
19.7%
133
17.0%
2 78
10.0%
5 50
 
6.4%
G 40
 
5.1%
S 40
 
5.1%
C 38
 
4.9%
U 28
 
3.6%
4 18
 
2.3%
Other values (18) 48
 
6.1%
None
ValueCountFrequency (%)
2
100.0%

연락처
Text

MISSING 

Distinct336
Distinct (%)99.7%
Missing163
Missing (%)32.6%
Memory size4.0 KiB
2023-12-12T15:30:56.857629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.008902
Min length12

Characters and Unicode

Total characters4047
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique335 ?
Unique (%)99.4%

Sample

1st row052-243-2728
2nd row052-298-1342
3rd row052-282-5045
4th row052-294-4422
5th row052-265-9909
ValueCountFrequency (%)
052-292-5549 2
 
0.6%
052-246-0884 1
 
0.3%
052-245-0129 1
 
0.3%
052-246-2225 1
 
0.3%
052-281-6330 1
 
0.3%
052-293-0195 1
 
0.3%
052-246-0681 1
 
0.3%
052-248-9393 1
 
0.3%
052-243-6771 1
 
0.3%
052-298-7034 1
 
0.3%
Other values (326) 326
96.7%
2023-12-12T15:30:57.357504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 840
20.8%
- 674
16.7%
0 545
13.5%
5 505
12.5%
4 288
 
7.1%
9 280
 
6.9%
8 217
 
5.4%
1 188
 
4.6%
3 187
 
4.6%
7 169
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3373
83.3%
Dash Punctuation 674
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 840
24.9%
0 545
16.2%
5 505
15.0%
4 288
 
8.5%
9 280
 
8.3%
8 217
 
6.4%
1 188
 
5.6%
3 187
 
5.5%
7 169
 
5.0%
6 154
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 674
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4047
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 840
20.8%
- 674
16.7%
0 545
13.5%
5 505
12.5%
4 288
 
7.1%
9 280
 
6.9%
8 217
 
5.4%
1 188
 
4.6%
3 187
 
4.6%
7 169
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4047
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 840
20.8%
- 674
16.7%
0 545
13.5%
5 505
12.5%
4 288
 
7.1%
9 280
 
6.9%
8 217
 
5.4%
1 188
 
4.6%
3 187
 
4.6%
7 169
 
4.2%

주소
Text

Distinct498
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-12T15:30:57.686369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length30
Mean length16.34
Min length6

Characters and Unicode

Total characters8170
Distinct characters210
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique496 ?
Unique (%)99.2%

Sample

1st row신기4길 29
2nd row내황 15길 23 (반구동)
3rd row남외1길 23
4th row옥교 8길 1, 1층 (학산동)
5th row중앙길 39 (성남동)
ValueCountFrequency (%)
중구 56
 
5.7%
태화동 14
 
1.4%
1층 13
 
1.3%
남구 11
 
1.1%
반구동 10
 
1.0%
종가로 9
 
0.9%
1 9
 
0.9%
울산광역시 8
 
0.8%
울산 7
 
0.7%
성안동 7
 
0.7%
Other values (676) 841
85.4%
2023-12-12T15:30:58.177647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 673
 
8.2%
491
 
6.0%
485
 
5.9%
) 438
 
5.4%
( 438
 
5.4%
2 377
 
4.6%
- 336
 
4.1%
3 308
 
3.8%
4 304
 
3.7%
297
 
3.6%
Other values (200) 4023
49.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3494
42.8%
Decimal Number 2887
35.3%
Space Separator 485
 
5.9%
Close Punctuation 438
 
5.4%
Open Punctuation 438
 
5.4%
Dash Punctuation 336
 
4.1%
Other Punctuation 69
 
0.8%
Uppercase Letter 23
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
491
 
14.1%
297
 
8.5%
183
 
5.2%
179
 
5.1%
121
 
3.5%
113
 
3.2%
104
 
3.0%
101
 
2.9%
83
 
2.4%
76
 
2.2%
Other values (172) 1746
50.0%
Uppercase Letter
ValueCountFrequency (%)
B 5
21.7%
L 5
21.7%
H 4
17.4%
A 2
 
8.7%
N 1
 
4.3%
M 1
 
4.3%
C 1
 
4.3%
I 1
 
4.3%
P 1
 
4.3%
R 1
 
4.3%
Decimal Number
ValueCountFrequency (%)
1 673
23.3%
2 377
13.1%
3 308
10.7%
4 304
10.5%
5 265
 
9.2%
7 221
 
7.7%
0 220
 
7.6%
6 203
 
7.0%
8 170
 
5.9%
9 146
 
5.1%
Other Punctuation
ValueCountFrequency (%)
, 65
94.2%
. 2
 
2.9%
/ 2
 
2.9%
Space Separator
ValueCountFrequency (%)
485
100.0%
Close Punctuation
ValueCountFrequency (%)
) 438
100.0%
Open Punctuation
ValueCountFrequency (%)
( 438
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 336
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4653
57.0%
Hangul 3494
42.8%
Latin 23
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
491
 
14.1%
297
 
8.5%
183
 
5.2%
179
 
5.1%
121
 
3.5%
113
 
3.2%
104
 
3.0%
101
 
2.9%
83
 
2.4%
76
 
2.2%
Other values (172) 1746
50.0%
Common
ValueCountFrequency (%)
1 673
14.5%
485
10.4%
) 438
9.4%
( 438
9.4%
2 377
8.1%
- 336
7.2%
3 308
 
6.6%
4 304
 
6.5%
5 265
 
5.7%
7 221
 
4.7%
Other values (7) 808
17.4%
Latin
ValueCountFrequency (%)
B 5
21.7%
L 5
21.7%
H 4
17.4%
A 2
 
8.7%
N 1
 
4.3%
M 1
 
4.3%
C 1
 
4.3%
I 1
 
4.3%
P 1
 
4.3%
R 1
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4676
57.2%
Hangul 3494
42.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 673
14.4%
485
10.4%
) 438
9.4%
( 438
9.4%
2 377
8.1%
- 336
7.2%
3 308
 
6.6%
4 304
 
6.5%
5 265
 
5.7%
7 221
 
4.7%
Other values (18) 831
17.8%
Hangul
ValueCountFrequency (%)
491
 
14.1%
297
 
8.5%
183
 
5.2%
179
 
5.1%
121
 
3.5%
113
 
3.2%
104
 
3.0%
101
 
2.9%
83
 
2.4%
76
 
2.2%
Other values (172) 1746
50.0%

데이터기준일자
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2020-09-14
499 
2019-10-01
 
1

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row2020-09-14
2nd row2020-09-14
3rd row2020-09-14
4th row2020-09-14
5th row2020-09-14

Common Values

ValueCountFrequency (%)
2020-09-14 499
99.8%
2019-10-01 1
 
0.2%

Length

2023-12-12T15:30:58.338577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:30:58.432744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-09-14 499
99.8%
2019-10-01 1
 
0.2%

Interactions

2023-12-12T15:30:54.848094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:30:58.501199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리번호데이터기준일자
관리번호1.0000.008
데이터기준일자0.0081.000
2023-12-12T15:30:58.599833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리번호데이터기준일자
관리번호1.0000.000
데이터기준일자0.0001.000

Missing values

2023-12-12T15:30:55.000746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:30:55.139165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관리번호업체명연락처주소데이터기준일자
01지에스(GS)25 테화강정원점<NA>신기4길 292020-09-14
12수전철물<NA>내황 15길 23 (반구동)2020-09-14
23씨유 남외동천<NA>남외1길 232020-09-14
34세븐일레븐 울산학산점<NA>옥교 8길 1, 1층 (학산동)2020-09-14
45진화기업사052-243-2728중앙길 39 (성남동)2020-09-14
56목화건재052-298-1342북구 진장동 61B6L2020-09-14
67평화종합철물건재<NA>태화로 106-1(태화동)2020-09-14
78디시마트(DC마트)052-282-5045반구정 4길 58 1층 (반구동)2020-09-14
89주식회사 대명창호부속052-294-4422해오름 2길 1 (남외동)2020-09-14
910보라종합철물건재<NA>시원길 7 (우정동)2020-09-14
관리번호업체명연락처주소데이터기준일자
490491지에스(GS)25 중구 그랜드점<NA>중구 반구정 14길 67(반구동)2020-09-14
491492새물약국<NA>중구 태화로 180, 1층 B호(태화동)2020-09-14
492493이마트24 울산우정<NA>우정7길 92020-09-14
493494프레쉬마켓<NA>중구 유곡로 3, 외 1필지2020-09-14
494495씨유울산다운타운점<NA>중구 운곡길 70, 1층2020-09-14
495496씨유 울산다운점<NA>다운로 120(다운동 761-2)2020-09-14
496497로그인 울산혁신점<NA>중구 종가로 250, 지하1층 102호(유곡동, 우정혁신동원1차)2020-09-14
497498한신스토아052-297-8571해오름6길23(남외동447-63)2020-09-14
498499금릉미니슈퍼052-224-3070다운15길7(다운동809-4)2020-09-14
499500중앙부속철물052-294-5257화합로416-1(반구동57-2)2020-09-14