Overview

Dataset statistics

Number of variables5
Number of observations52
Missing cells4
Missing cells (%)1.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory42.5 B

Variable types

Text3
Categorical2

Dataset

Description인천광역시 중구 폐기물 수집 및 운반업체에 대한 정보입니다. 파일명 인천광역시 중구 폐기물 수집 운반 업체 현황 내용 업소명, 영업대상 폐기물 등
URLhttps://www.data.go.kr/data/15075092/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 4 (7.7%) missing valuesMissing

Reproduction

Analysis started2023-12-12 12:07:34.443092
Analysis finished2023-12-12 12:07:34.879355
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct42
Distinct (%)80.8%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-12T21:07:35.066318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length5.8269231
Min length3

Characters and Unicode

Total characters303
Distinct characters91
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)65.4%

Sample

1st row㈜은성개발
2nd row㈜대한공해엔지니어링
3rd row㈜은성개발
4th row㈜밸런스인더스트리
5th row㈜제이원개발
ValueCountFrequency (%)
인성코퍼레이션㈜ 3
 
5.8%
㈜고려환경 3
 
5.8%
㈜은성개발 2
 
3.8%
㈜대영알씨 2
 
3.8%
주)태산산업 2
 
3.8%
㈜중앙emc 2
 
3.8%
성석개발㈜ 2
 
3.8%
㈜밸런스인더스트리 2
 
3.8%
신성환경 1
 
1.9%
㈜아산개발 1
 
1.9%
Other values (32) 32
61.5%
2023-12-12T21:07:35.442769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
44
 
14.5%
13
 
4.3%
13
 
4.3%
13
 
4.3%
12
 
4.0%
11
 
3.6%
11
 
3.6%
10
 
3.3%
9
 
3.0%
6
 
2.0%
Other values (81) 161
53.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 247
81.5%
Other Symbol 44
 
14.5%
Lowercase Letter 4
 
1.3%
Decimal Number 2
 
0.7%
Open Punctuation 2
 
0.7%
Uppercase Letter 2
 
0.7%
Close Punctuation 2
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
5.3%
13
 
5.3%
13
 
5.3%
12
 
4.9%
11
 
4.5%
11
 
4.5%
10
 
4.0%
9
 
3.6%
6
 
2.4%
6
 
2.4%
Other values (74) 143
57.9%
Lowercase Letter
ValueCountFrequency (%)
m 2
50.0%
c 2
50.0%
Other Symbol
ValueCountFrequency (%)
44
100.0%
Decimal Number
ValueCountFrequency (%)
9 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
E 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 291
96.0%
Common 6
 
2.0%
Latin 6
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
15.1%
13
 
4.5%
13
 
4.5%
13
 
4.5%
12
 
4.1%
11
 
3.8%
11
 
3.8%
10
 
3.4%
9
 
3.1%
6
 
2.1%
Other values (75) 149
51.2%
Common
ValueCountFrequency (%)
9 2
33.3%
( 2
33.3%
) 2
33.3%
Latin
ValueCountFrequency (%)
E 2
33.3%
m 2
33.3%
c 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 247
81.5%
None 44
 
14.5%
ASCII 12
 
4.0%

Most frequent character per block

None
ValueCountFrequency (%)
44
100.0%
Hangul
ValueCountFrequency (%)
13
 
5.3%
13
 
5.3%
13
 
5.3%
12
 
4.9%
11
 
4.5%
11
 
4.5%
10
 
4.0%
9
 
3.6%
6
 
2.4%
6
 
2.4%
Other values (74) 143
57.9%
ASCII
ValueCountFrequency (%)
9 2
16.7%
( 2
16.7%
E 2
16.7%
m 2
16.7%
c 2
16.7%
) 2
16.7%
Distinct4
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size548.0 B
사업장배출시설계
26 
건설폐기물
13 
사업장비배출시설계
11 
생활폐기물
 
2

Length

Max length9
Median length8.5
Mean length7.3461538
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row생활폐기물
2nd row생활폐기물
3rd row사업장비배출시설계
4th row사업장비배출시설계
5th row사업장비배출시설계

Common Values

ValueCountFrequency (%)
사업장배출시설계 26
50.0%
건설폐기물 13
25.0%
사업장비배출시설계 11
21.2%
생활폐기물 2
 
3.8%

Length

2023-12-12T21:07:35.924115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:07:36.042113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장배출시설계 26
50.0%
건설폐기물 13
25.0%
사업장비배출시설계 11
21.2%
생활폐기물 2
 
3.8%

주소
Text

Distinct42
Distinct (%)80.8%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-12T21:07:36.366214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length26
Mean length21.826923
Min length14

Characters and Unicode

Total characters1135
Distinct characters62
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)65.4%

Sample

1st row인천광역시 중구 서해대로 336, 신관605호
2nd row인천광역시 중구 서해대로 110
3rd row인천광역시 중구 서해대로 336, 신관605호
4th row인천광역시 중구 축항대로 249
5th row인천광역시 중구 우현로 34, 2층
ValueCountFrequency (%)
중구 52
21.5%
인천광역시 51
21.1%
축항대로 6
 
2.5%
2층 6
 
2.5%
1층 4
 
1.7%
6 4
 
1.7%
서해대로 4
 
1.7%
인항로 4
 
1.7%
3층 4
 
1.7%
12 3
 
1.2%
Other values (77) 104
43.0%
2023-12-12T21:07:36.877325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
195
17.2%
60
 
5.3%
59
 
5.2%
52
 
4.6%
52
 
4.6%
52
 
4.6%
51
 
4.5%
51
 
4.5%
51
 
4.5%
1 43
 
3.8%
Other values (52) 469
41.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 649
57.2%
Decimal Number 248
 
21.9%
Space Separator 195
 
17.2%
Other Punctuation 32
 
2.8%
Dash Punctuation 8
 
0.7%
Uppercase Letter 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
60
 
9.2%
59
 
9.1%
52
 
8.0%
52
 
8.0%
52
 
8.0%
51
 
7.9%
51
 
7.9%
51
 
7.9%
30
 
4.6%
24
 
3.7%
Other values (37) 167
25.7%
Decimal Number
ValueCountFrequency (%)
1 43
17.3%
2 38
15.3%
4 30
12.1%
3 27
10.9%
0 23
9.3%
7 19
7.7%
9 18
7.3%
8 17
 
6.9%
6 17
 
6.9%
5 16
 
6.5%
Uppercase Letter
ValueCountFrequency (%)
A 2
66.7%
B 1
33.3%
Space Separator
ValueCountFrequency (%)
195
100.0%
Other Punctuation
ValueCountFrequency (%)
, 32
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 649
57.2%
Common 483
42.6%
Latin 3
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
60
 
9.2%
59
 
9.1%
52
 
8.0%
52
 
8.0%
52
 
8.0%
51
 
7.9%
51
 
7.9%
51
 
7.9%
30
 
4.6%
24
 
3.7%
Other values (37) 167
25.7%
Common
ValueCountFrequency (%)
195
40.4%
1 43
 
8.9%
2 38
 
7.9%
, 32
 
6.6%
4 30
 
6.2%
3 27
 
5.6%
0 23
 
4.8%
7 19
 
3.9%
9 18
 
3.7%
8 17
 
3.5%
Other values (3) 41
 
8.5%
Latin
ValueCountFrequency (%)
A 2
66.7%
B 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 649
57.2%
ASCII 486
42.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
195
40.1%
1 43
 
8.8%
2 38
 
7.8%
, 32
 
6.6%
4 30
 
6.2%
3 27
 
5.6%
0 23
 
4.7%
7 19
 
3.9%
9 18
 
3.7%
8 17
 
3.5%
Other values (5) 44
 
9.1%
Hangul
ValueCountFrequency (%)
60
 
9.2%
59
 
9.1%
52
 
8.0%
52
 
8.0%
52
 
8.0%
51
 
7.9%
51
 
7.9%
51
 
7.9%
30
 
4.6%
24
 
3.7%
Other values (37) 167
25.7%

전화번호
Text

MISSING 

Distinct39
Distinct (%)81.2%
Missing4
Missing (%)7.7%
Memory size548.0 B
2023-12-12T21:07:37.136419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.0625
Min length12

Characters and Unicode

Total characters579
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)66.7%

Sample

1st row032-885-7922
2nd row032-884-3671
3rd row032-885-7922
4th row032-885-8725
5th row032-761-3731
ValueCountFrequency (%)
032-887-8026 3
 
6.2%
032-555-7971 3
 
6.2%
032-885-7922 2
 
4.2%
032-885-8725 2
 
4.2%
032-887-4053 2
 
4.2%
032-751-2100 2
 
4.2%
032-887-8878 2
 
4.2%
032-822-7266 1
 
2.1%
032-888-0038 1
 
2.1%
032-763-1199 1
 
2.1%
Other values (29) 29
60.4%
2023-12-12T21:07:37.565394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 96
16.6%
8 91
15.7%
2 81
14.0%
0 76
13.1%
3 67
11.6%
7 47
8.1%
1 32
 
5.5%
5 30
 
5.2%
6 20
 
3.5%
9 20
 
3.5%
Other values (2) 19
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 482
83.2%
Dash Punctuation 96
 
16.6%
Math Symbol 1
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 91
18.9%
2 81
16.8%
0 76
15.8%
3 67
13.9%
7 47
9.8%
1 32
 
6.6%
5 30
 
6.2%
6 20
 
4.1%
9 20
 
4.1%
4 18
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 96
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 579
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 96
16.6%
8 91
15.7%
2 81
14.0%
0 76
13.1%
3 67
11.6%
7 47
8.1%
1 32
 
5.5%
5 30
 
5.2%
6 20
 
3.5%
9 20
 
3.5%
Other values (2) 19
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 579
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 96
16.6%
8 91
15.7%
2 81
14.0%
0 76
13.1%
3 67
11.6%
7 47
8.1%
1 32
 
5.5%
5 30
 
5.2%
6 20
 
3.5%
9 20
 
3.5%
Other values (2) 19
 
3.3%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-07-27
52 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-27
2nd row2023-07-27
3rd row2023-07-27
4th row2023-07-27
5th row2023-07-27

Common Values

ValueCountFrequency (%)
2023-07-27 52
100.0%

Length

2023-12-12T21:07:37.749798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:07:37.844900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-27 52
100.0%

Correlations

2023-12-12T21:07:37.915794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명영업대상 폐기물주소전화번호
업소명1.0000.0001.0001.000
영업대상 폐기물0.0001.0000.0000.000
주소1.0000.0001.0000.998
전화번호1.0000.0000.9981.000

Missing values

2023-12-12T21:07:34.739450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:07:34.839943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명영업대상 폐기물주소전화번호데이터기준일자
0㈜은성개발생활폐기물인천광역시 중구 서해대로 336, 신관605호032-885-79222023-07-27
1㈜대한공해엔지니어링생활폐기물인천광역시 중구 서해대로 110032-884-36712023-07-27
2㈜은성개발사업장비배출시설계인천광역시 중구 서해대로 336, 신관605호032-885-79222023-07-27
3㈜밸런스인더스트리사업장비배출시설계인천광역시 중구 축항대로 249032-885-87252023-07-27
4㈜제이원개발사업장비배출시설계인천광역시 중구 우현로 34, 2층032-761-37312023-07-27
5경인환경산업㈜사업장비배출시설계인천광역시 중구 신포로23번길 49, 207호070-4064-31192023-07-27
6인성코퍼레이션㈜사업장비배출시설계인천광역시 중구 서해대로93번길 14-1, 3층032-555-79712023-07-27
7㈜고려환경사업장비배출시설계인천광역시 중구 서해대로454번길 12, 1층032-887-80262023-07-27
8친환경사업장비배출시설계인천광역시 중구 율목로 6-1,1층<NA>2023-07-27
9㈜대영알씨사업장비배출시설계인천광역시 중구 서해대로94번길 47032-887-88782023-07-27
업소명영업대상 폐기물주소전화번호데이터기준일자
42㈜밸런스인더스트리사업장배출시설계인천광역시 중구 축항대로 249032-885-87252023-07-27
43성석개발㈜사업장배출시설계인천광역시 중구 축항대로212번길 48032-887-40532023-07-27
44㈜영진운수사업장배출시설계인천광역시 중구 축항대로290번길 164, 103호032-764-32012023-07-27
45㈜해천물류사업장배출시설계인천광역시 중구 인항로 6, A동 509호032-884-33612023-07-27
46태왕㈜사업장배출시설계인천광역시 중구 신포로 8, 801호032-763-11992023-07-27
47(주)태산산업사업장비배출시설계인천광역시 중구 중산로 82032-751-21002023-07-27
48(주)태산산업사업장배출시설계인천광역시 중구 중산로 82032-751-21002023-07-27
49신성환경사업장비배출시설계인천광역시 중구 마장포로40번길 49032-751-34162023-07-27
50㈜99철거사업장배출시설계인천 중구 운중로 137-83032-751-44442023-07-27
51유민철강산업㈜사업장비배출시설계인천광역시 중구 영종대로196번길 15-7, 스카이탑 507호<NA>2023-07-27