Overview

Dataset statistics

Number of variables5
Number of observations471
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.5 KiB
Average record size in memory40.3 B

Variable types

Text3
Categorical2

Dataset

Description수원도시공사 내 자원순환센터에서 판매중인 종량제봉투를 구매하여 수원도시공사 측에서 배송하는 거래업체에 대한 정보 입니다
Author수원도시공사
URLhttps://www.data.go.kr/data/3084453/fileData.do

Alerts

폐업유무 has constant value ""Constant

Reproduction

Analysis started2024-01-06 12:10:53.475596
Analysis finished2024-01-06 12:10:54.565699
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct424
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2024-01-06T12:10:54.994274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length3
Mean length3.0721868
Min length2

Characters and Unicode

Total characters1447
Distinct characters181
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique394 ?
Unique (%)83.7%

Sample

1st row이승손
2nd row노요민
3rd row김영수
4th row곽병일
5th row이우춘
ValueCountFrequency (%)
최경호 8
 
1.7%
이제훈 5
 
1.1%
김성영 5
 
1.1%
강희석 4
 
0.8%
정재훈 3
 
0.6%
강성현 3
 
0.6%
김영태 3
 
0.6%
안성준 2
 
0.4%
이은경 2
 
0.4%
임인숙 2
 
0.4%
Other values (415) 435
92.2%
2024-01-06T12:10:56.018715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
110
 
7.6%
81
 
5.6%
57
 
3.9%
41
 
2.8%
37
 
2.6%
36
 
2.5%
32
 
2.2%
30
 
2.1%
28
 
1.9%
28
 
1.9%
Other values (171) 967
66.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1424
98.4%
Lowercase Letter 17
 
1.2%
Other Punctuation 3
 
0.2%
Decimal Number 2
 
0.1%
Space Separator 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
110
 
7.7%
81
 
5.7%
57
 
4.0%
41
 
2.9%
37
 
2.6%
36
 
2.5%
32
 
2.2%
30
 
2.1%
28
 
2.0%
28
 
2.0%
Other values (159) 944
66.3%
Lowercase Letter
ValueCountFrequency (%)
u 4
23.5%
i 3
17.6%
n 2
11.8%
h 2
11.8%
y 2
11.8%
a 1
 
5.9%
c 1
 
5.9%
j 1
 
5.9%
z 1
 
5.9%
Other Punctuation
ValueCountFrequency (%)
/ 3
100.0%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1424
98.4%
Latin 17
 
1.2%
Common 6
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
110
 
7.7%
81
 
5.7%
57
 
4.0%
41
 
2.9%
37
 
2.6%
36
 
2.5%
32
 
2.2%
30
 
2.1%
28
 
2.0%
28
 
2.0%
Other values (159) 944
66.3%
Latin
ValueCountFrequency (%)
u 4
23.5%
i 3
17.6%
n 2
11.8%
h 2
11.8%
y 2
11.8%
a 1
 
5.9%
c 1
 
5.9%
j 1
 
5.9%
z 1
 
5.9%
Common
ValueCountFrequency (%)
/ 3
50.0%
1 2
33.3%
1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1424
98.4%
ASCII 23
 
1.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
110
 
7.7%
81
 
5.7%
57
 
4.0%
41
 
2.9%
37
 
2.6%
36
 
2.5%
32
 
2.2%
30
 
2.1%
28
 
2.0%
28
 
2.0%
Other values (159) 944
66.3%
ASCII
ValueCountFrequency (%)
u 4
17.4%
i 3
13.0%
/ 3
13.0%
n 2
8.7%
1 2
8.7%
h 2
8.7%
y 2
8.7%
a 1
 
4.3%
1
 
4.3%
c 1
 
4.3%
Other values (2) 2
8.7%

폐업유무
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
영업
471 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row영업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
영업 471
100.0%

Length

2024-01-06T12:10:56.441411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-06T12:10:56.722548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 471
100.0%
Distinct453
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2024-01-06T12:10:57.198272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters5652
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique446 ?
Unique (%)94.7%

Sample

1st row829-17-01859
2nd row453-45-00879
3rd row862-01-02422
4th row240-08-01092
5th row782-12-00110
ValueCountFrequency (%)
220-81-60348 5
 
1.1%
206-86-50913 5
 
1.1%
212-81-25544 5
 
1.1%
107-86-57147 3
 
0.6%
442-82-00137 3
 
0.6%
208-81-27413 2
 
0.4%
126-86-45211 2
 
0.4%
543-68-00318 1
 
0.2%
658-19-01173 1
 
0.2%
394-12-01606 1
 
0.2%
Other values (443) 443
94.1%
2024-01-06T12:10:58.092102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 942
16.7%
0 813
14.4%
1 669
11.8%
2 589
10.4%
3 453
8.0%
5 446
7.9%
4 433
7.7%
8 383
6.8%
6 353
 
6.2%
7 294
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4710
83.3%
Dash Punctuation 942
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 813
17.3%
1 669
14.2%
2 589
12.5%
3 453
9.6%
5 446
9.5%
4 433
9.2%
8 383
8.1%
6 353
7.5%
7 294
 
6.2%
9 277
 
5.9%
Dash Punctuation
ValueCountFrequency (%)
- 942
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5652
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 942
16.7%
0 813
14.4%
1 669
11.8%
2 589
10.4%
3 453
8.0%
5 446
7.9%
4 433
7.7%
8 383
6.8%
6 353
 
6.2%
7 294
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5652
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 942
16.7%
0 813
14.4%
1 669
11.8%
2 589
10.4%
3 453
8.0%
5 446
7.9%
4 433
7.7%
8 383
6.8%
6 353
 
6.2%
7 294
 
5.2%

배송요일
Categorical

Distinct5
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
107 
103 
98 
83 
80 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
107
22.7%
103
21.9%
98
20.8%
83
17.6%
80
17.0%

Length

2024-01-06T12:10:58.560377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-06T12:10:58.878916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
107
22.7%
103
21.9%
98
20.8%
83
17.6%
80
17.0%
Distinct469
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2024-01-06T12:10:59.390202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length10.528662
Min length3

Characters and Unicode

Total characters4959
Distinct characters319
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique467 ?
Unique (%)99.2%

Sample

1st rowCU(매탄중부대로점)
2nd rowCU(매탄중앙점)
3rd rowCU(매탄힐스점)
4th rowGS(매탄힐스점)
5th rowGS25(매탄중부로)
ValueCountFrequency (%)
gs25 10
 
1.9%
이마트24 8
 
1.5%
광교대학로점 2
 
0.4%
농민마트 2
 
0.4%
주)마트킹 2
 
0.4%
망포원룸점 2
 
0.4%
영통점 2
 
0.4%
이마트에브리데이 2
 
0.4%
세븐일레븐(수원신원로점 2
 
0.4%
이마트24(영통영동점 1
 
0.2%
Other values (484) 484
93.6%
2024-01-06T12:11:00.699171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 403
 
8.1%
) 403
 
8.1%
345
 
7.0%
2 148
 
3.0%
144
 
2.9%
120
 
2.4%
116
 
2.3%
S 116
 
2.3%
113
 
2.3%
U 112
 
2.3%
Other values (309) 2939
59.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3316
66.9%
Uppercase Letter 464
 
9.4%
Open Punctuation 403
 
8.1%
Close Punctuation 403
 
8.1%
Decimal Number 313
 
6.3%
Space Separator 46
 
0.9%
Lowercase Letter 13
 
0.3%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
345
 
10.4%
144
 
4.3%
120
 
3.6%
116
 
3.5%
113
 
3.4%
109
 
3.3%
104
 
3.1%
101
 
3.0%
100
 
3.0%
88
 
2.7%
Other values (279) 1976
59.6%
Uppercase Letter
ValueCountFrequency (%)
S 116
25.0%
U 112
24.1%
C 111
23.9%
G 110
23.7%
R 3
 
0.6%
K 3
 
0.6%
L 3
 
0.6%
H 2
 
0.4%
W 1
 
0.2%
D 1
 
0.2%
Other values (2) 2
 
0.4%
Decimal Number
ValueCountFrequency (%)
2 148
47.3%
5 107
34.2%
4 37
 
11.8%
1 8
 
2.6%
3 4
 
1.3%
6 3
 
1.0%
7 3
 
1.0%
0 1
 
0.3%
9 1
 
0.3%
8 1
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
e 4
30.8%
l 3
23.1%
f 3
23.1%
s 3
23.1%
Open Punctuation
ValueCountFrequency (%)
( 403
100.0%
Close Punctuation
ValueCountFrequency (%)
) 403
100.0%
Space Separator
ValueCountFrequency (%)
46
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3316
66.9%
Common 1166
 
23.5%
Latin 477
 
9.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
345
 
10.4%
144
 
4.3%
120
 
3.6%
116
 
3.5%
113
 
3.4%
109
 
3.3%
104
 
3.1%
101
 
3.0%
100
 
3.0%
88
 
2.7%
Other values (279) 1976
59.6%
Latin
ValueCountFrequency (%)
S 116
24.3%
U 112
23.5%
C 111
23.3%
G 110
23.1%
e 4
 
0.8%
R 3
 
0.6%
l 3
 
0.6%
K 3
 
0.6%
L 3
 
0.6%
f 3
 
0.6%
Other values (6) 9
 
1.9%
Common
ValueCountFrequency (%)
( 403
34.6%
) 403
34.6%
2 148
 
12.7%
5 107
 
9.2%
46
 
3.9%
4 37
 
3.2%
1 8
 
0.7%
3 4
 
0.3%
6 3
 
0.3%
7 3
 
0.3%
Other values (4) 4
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3316
66.9%
ASCII 1643
33.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 403
24.5%
) 403
24.5%
2 148
 
9.0%
S 116
 
7.1%
U 112
 
6.8%
C 111
 
6.8%
G 110
 
6.7%
5 107
 
6.5%
46
 
2.8%
4 37
 
2.3%
Other values (20) 50
 
3.0%
Hangul
ValueCountFrequency (%)
345
 
10.4%
144
 
4.3%
120
 
3.6%
116
 
3.5%
113
 
3.4%
109
 
3.3%
104
 
3.1%
101
 
3.0%
100
 
3.0%
88
 
2.7%
Other values (279) 1976
59.6%

Missing values

2024-01-06T12:10:54.132530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-06T12:10:54.447083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대표자명폐업유무사업자번호배송요일거래처명
0이승손영업829-17-01859CU(매탄중부대로점)
1노요민영업453-45-00879CU(매탄중앙점)
2김영수영업862-01-02422CU(매탄힐스점)
3곽병일영업240-08-01092GS(매탄힐스점)
4이우춘영업782-12-00110GS25(매탄중부로)
5박효주영업620-32-01124GS25(영통으뜸점)
6박성복영업414-04-20673경기유통
7최연식영업503-28-63639다팜몰 (수원점)
8김동기영업135-26-56402대동종합상사
9윤문선영업530-31-00621세븐일레븐(매탄매화점)
대표자명폐업유무사업자번호배송요일거래처명
461김영태영업107-86-57147영통맞이방편의점
462김영태영업107-86-57147영통맞이방편의점(1번출구)
463이은정영업883-01-00509우리농산물마트(영통)
464손순애영업135-32-66084이마트24(영통영동점)
465박웅진영업124-32-94561이마트24(영통점)
466박근곤영업111-16-92058이마트24(영통주공5단지점)
467김보람영업538-85-02136자연드림(영통점)
468최경호영업784-85-02291코리아세븐(영통경희대점)
469윤희창영업124-12-39274포유25
470이제훈영업220-81-60348홈플러스익스프레스(영통점)