Overview

Dataset statistics

Number of variables3
Number of observations52
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)3.8%
Total size in memory1.3 KiB
Average record size in memory26.5 B

Variable types

Categorical1
Text2

Dataset

Description서울특별시 성동구 사업장폐기물신고현황 정보입니다. 사업장 폐기물 신고업체명, 주소, 소속 자치구 등의 정보를 포함하고 있습니다.
URLhttps://www.data.go.kr/data/15062097/fileData.do

Alerts

자치구 has constant value ""Constant
Dataset has 2 (3.8%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-11 23:28:02.815229
Analysis finished2023-12-11 23:28:03.096591
Duration0.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자치구
Categorical

CONSTANT 

Distinct1
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size548.0 B
성동구
52 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성동구
2nd row성동구
3rd row성동구
4th row성동구
5th row성동구

Common Values

ValueCountFrequency (%)
성동구 52
100.0%

Length

2023-12-12T08:28:03.148472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:28:03.225683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
성동구 52
100.0%
Distinct50
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-12T08:28:03.390932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length12.5
Mean length9.25
Min length3

Characters and Unicode

Total characters481
Distinct characters165
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)92.3%

Sample

1st row(주)도성개발
2nd row성원이앤아이(주)
3rd row만강건설(주)
4th row주식회사 넥스트키친
5th row(주)로얄테크
ValueCountFrequency (%)
주식회사 4
 
6.0%
주)신세계푸드 3
 
4.5%
한양대학교 2
 
3.0%
메인주방 2
 
3.0%
군자차량사업소 2
 
3.0%
시설관리공단 1
 
1.5%
주-한국아스텐엔지니어링 1
 
1.5%
주)도성개발 1
 
1.5%
경남식품 1
 
1.5%
주)태양미트고기선수촌 1
 
1.5%
Other values (49) 49
73.1%
2023-12-12T08:28:03.682008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
 
7.9%
( 30
 
6.2%
) 30
 
6.2%
15
 
3.1%
12
 
2.5%
11
 
2.3%
9
 
1.9%
9
 
1.9%
8
 
1.7%
8
 
1.7%
Other values (155) 311
64.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 400
83.2%
Open Punctuation 30
 
6.2%
Close Punctuation 30
 
6.2%
Space Separator 15
 
3.1%
Uppercase Letter 2
 
0.4%
Other Punctuation 1
 
0.2%
Dash Punctuation 1
 
0.2%
Decimal Number 1
 
0.2%
Lowercase Letter 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
9.5%
12
 
3.0%
11
 
2.8%
9
 
2.2%
9
 
2.2%
8
 
2.0%
8
 
2.0%
7
 
1.8%
6
 
1.5%
6
 
1.5%
Other values (146) 286
71.5%
Uppercase Letter
ValueCountFrequency (%)
R 1
50.0%
D 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Space Separator
ValueCountFrequency (%)
15
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
k 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 400
83.2%
Common 78
 
16.2%
Latin 3
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
9.5%
12
 
3.0%
11
 
2.8%
9
 
2.2%
9
 
2.2%
8
 
2.0%
8
 
2.0%
7
 
1.8%
6
 
1.5%
6
 
1.5%
Other values (146) 286
71.5%
Common
ValueCountFrequency (%)
( 30
38.5%
) 30
38.5%
15
19.2%
& 1
 
1.3%
- 1
 
1.3%
1 1
 
1.3%
Latin
ValueCountFrequency (%)
R 1
33.3%
D 1
33.3%
k 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 400
83.2%
ASCII 81
 
16.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
38
 
9.5%
12
 
3.0%
11
 
2.8%
9
 
2.2%
9
 
2.2%
8
 
2.0%
8
 
2.0%
7
 
1.8%
6
 
1.5%
6
 
1.5%
Other values (146) 286
71.5%
ASCII
ValueCountFrequency (%)
( 30
37.0%
) 30
37.0%
15
18.5%
R 1
 
1.2%
& 1
 
1.2%
D 1
 
1.2%
- 1
 
1.2%
1 1
 
1.2%
k 1
 
1.2%

주소
Text

Distinct48
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-12T08:28:03.948985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length35
Mean length28.903846
Min length19

Characters and Unicode

Total characters1503
Distinct characters125
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)84.6%

Sample

1st row서울특별시 성동구 천호대로 440 인원빌딩 (용답동)
2nd row서울특별시 성동구 가람길 173 (송정동)
3rd row서울특별시 성동구 가람길 113 후생관 (송정동)
4th row서울특별시 성동구 뚝섬로 403 (성수동2가)
5th row서울특별시 성동구 성수일로4길 25 서울숲코오롱디지털타워 (성수동2가)
ValueCountFrequency (%)
서울특별시 52
 
17.8%
성동구 52
 
17.8%
마장동 12
 
4.1%
성수동2가 11
 
3.8%
용답동 6
 
2.1%
송정동 6
 
2.1%
1층 5
 
1.7%
7 5
 
1.7%
가람길 4
 
1.4%
성수동1가 4
 
1.4%
Other values (97) 135
46.2%
2023-12-12T08:28:04.349425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
245
 
16.3%
107
 
7.1%
73
 
4.9%
60
 
4.0%
55
 
3.7%
55
 
3.7%
52
 
3.5%
52
 
3.5%
52
 
3.5%
( 46
 
3.1%
Other values (115) 706
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 965
64.2%
Space Separator 245
 
16.3%
Decimal Number 192
 
12.8%
Open Punctuation 46
 
3.1%
Close Punctuation 46
 
3.1%
Dash Punctuation 6
 
0.4%
Uppercase Letter 2
 
0.1%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
107
 
11.1%
73
 
7.6%
60
 
6.2%
55
 
5.7%
55
 
5.7%
52
 
5.4%
52
 
5.4%
52
 
5.4%
40
 
4.1%
31
 
3.2%
Other values (98) 388
40.2%
Decimal Number
ValueCountFrequency (%)
1 43
22.4%
2 38
19.8%
3 23
12.0%
7 21
10.9%
5 16
 
8.3%
4 14
 
7.3%
0 14
 
7.3%
8 9
 
4.7%
9 7
 
3.6%
6 7
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
245
100.0%
Open Punctuation
ValueCountFrequency (%)
( 46
100.0%
Close Punctuation
ValueCountFrequency (%)
) 46
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 965
64.2%
Common 536
35.7%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
107
 
11.1%
73
 
7.6%
60
 
6.2%
55
 
5.7%
55
 
5.7%
52
 
5.4%
52
 
5.4%
52
 
5.4%
40
 
4.1%
31
 
3.2%
Other values (98) 388
40.2%
Common
ValueCountFrequency (%)
245
45.7%
( 46
 
8.6%
) 46
 
8.6%
1 43
 
8.0%
2 38
 
7.1%
3 23
 
4.3%
7 21
 
3.9%
5 16
 
3.0%
4 14
 
2.6%
0 14
 
2.6%
Other values (5) 30
 
5.6%
Latin
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 965
64.2%
ASCII 538
35.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
245
45.5%
( 46
 
8.6%
) 46
 
8.6%
1 43
 
8.0%
2 38
 
7.1%
3 23
 
4.3%
7 21
 
3.9%
5 16
 
3.0%
4 14
 
2.6%
0 14
 
2.6%
Other values (7) 32
 
5.9%
Hangul
ValueCountFrequency (%)
107
 
11.1%
73
 
7.6%
60
 
6.2%
55
 
5.7%
55
 
5.7%
52
 
5.4%
52
 
5.4%
52
 
5.4%
40
 
4.1%
31
 
3.2%
Other values (98) 388
40.2%

Correlations

2023-12-12T08:28:04.437276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업장명주소
사업장명1.0001.000
주소1.0001.000

Missing values

2023-12-12T08:28:03.007531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:28:03.070521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자치구사업장명주소
0성동구(주)도성개발서울특별시 성동구 천호대로 440 인원빌딩 (용답동)
1성동구성원이앤아이(주)서울특별시 성동구 가람길 173 (송정동)
2성동구만강건설(주)서울특별시 성동구 가람길 113 후생관 (송정동)
3성동구주식회사 넥스트키친서울특별시 성동구 뚝섬로 403 (성수동2가)
4성동구(주)로얄테크서울특별시 성동구 성수일로4길 25 서울숲코오롱디지털타워 (성수동2가)
5성동구세종씨앤피주식회사서울특별시 성동구 아차산로 138 (성수동2가)
6성동구주식회사 솔마루미트서울특별시 성동구 고산자로 330 B1층 (마장동)
7성동구주식회사 천일에너지서울특별시 성동구 가람길 113 후생관 송정동 (송정동)
8성동구(주)신세계푸드 메인주방서울특별시 성동구 성수이로 51 서울숲한라시그마벨리 지하1층
9성동구적성건설(주)서울특별시 성동구 무학봉28길 5 (하왕십리동 상리제나우스)
자치구사업장명주소
42성동구(주)용진기전서울특별시 성동구 자동차시장1길 70 장안평중고자동차시장 A동 3층
43성동구호산테크닉서울특별시 성동구 아차산로5길 19-1 (성수동2가)
44성동구한국인터텍테스팅서비스(주)서울특별시 성동구 아차산로5길 7 1층 (성수동2가 아주디지털타워 )
45성동구한양대학교서울특별시 성동구 왕십리로 222 한양대학교 (사근동)
46성동구(주)동국금속서울특별시 성동구 성수이로18길 44 1층 (성수동2가)
47성동구주성바렛연마서울특별시 성동구 성수동1가 13-14
48성동구(주)예스코서울특별시 성동구 자동차시장길 23 (용답동 예스코)
49성동구서울메트로 군자차량사업소서울특별시 성동구 천호대로78길 58 (용답동)
50성동구주-한국아스텐엔지니어링서울특별시 성동구 자동차시장3길 64 (송정동)
51성동구서울시중랑물재생센터서울특별시 성동구 자동차시장3길 64 (용답동 중랑물재생센터)

Duplicate rows

Most frequently occurring

자치구사업장명주소# duplicates
0성동구(주)신세계푸드 메인주방서울특별시 성동구 성수이로 51 서울숲한라시그마벨리 지하1층2
1성동구한양대학교서울특별시 성동구 왕십리로 222 한양대학교 (사근동)2