Overview

Dataset statistics

Number of variables4
Number of observations79
Missing cells3
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 KiB
Average record size in memory34.6 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description인천광역시 서구에 위치한 폐기물중간재활용 업체 현황에 관한 데이터셋입니다. 인천광역시 서구에 위치한 폐기물중간재활용 업체 현황의 상호, 주소에 관한 정보를 포함하고 있습니다.
Author인천광역시 서구
URLhttps://www.data.go.kr/data/15091255/fileData.do

Alerts

연번 is highly overall correlated with 데이터기준일자High correlation
데이터기준일자 is highly overall correlated with 연번High correlation
데이터기준일자 is highly imbalanced (90.2%)Imbalance
연번 has 1 (1.3%) missing valuesMissing
상호 has 1 (1.3%) missing valuesMissing
주소 has 1 (1.3%) missing valuesMissing

Reproduction

Analysis started2024-03-14 14:42:50.563671
Analysis finished2024-03-14 14:42:51.544226
Duration0.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct78
Distinct (%)100.0%
Missing1
Missing (%)1.3%
Infinite0
Infinite (%)0.0%
Mean43.025641
Minimum1
Maximum83
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size839.0 B
2024-03-14T23:42:51.678800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.7
Q121.5
median44.5
Q363.75
95-th percentile79.15
Maximum83
Range82
Interquartile range (IQR)42.25

Descriptive statistics

Standard deviation24.321961
Coefficient of variation (CV)0.56528991
Kurtosis-1.2278574
Mean43.025641
Median Absolute Deviation (MAD)21
Skewness-0.07281237
Sum3356
Variance591.55778
MonotonicityStrictly increasing
2024-03-14T23:42:51.946098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.3%
56 1
 
1.3%
63 1
 
1.3%
62 1
 
1.3%
61 1
 
1.3%
60 1
 
1.3%
59 1
 
1.3%
58 1
 
1.3%
57 1
 
1.3%
55 1
 
1.3%
Other values (68) 68
86.1%
ValueCountFrequency (%)
1 1
1.3%
2 1
1.3%
3 1
1.3%
4 1
1.3%
6 1
1.3%
7 1
1.3%
8 1
1.3%
9 1
1.3%
10 1
1.3%
11 1
1.3%
ValueCountFrequency (%)
83 1
1.3%
82 1
1.3%
81 1
1.3%
80 1
1.3%
79 1
1.3%
78 1
1.3%
77 1
1.3%
76 1
1.3%
75 1
1.3%
74 1
1.3%

상호
Text

MISSING 

Distinct78
Distinct (%)100.0%
Missing1
Missing (%)1.3%
Memory size760.0 B
2024-03-14T23:42:52.876168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length13
Mean length7.8461538
Min length3

Characters and Unicode

Total characters612
Distinct characters142
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique78 ?
Unique (%)100.0%

Sample

1st row(주)에이원에코
2nd row성진산업
3rd row경진철강
4th row(주)케이윤산업
5th row대신자원개발
ValueCountFrequency (%)
그린에코넥서스(주 2
 
2.4%
주)에이원에코 1
 
1.2%
태주수지 1
 
1.2%
조양인더스트리(주 1
 
1.2%
주)준메탈 1
 
1.2%
인천지점 1
 
1.2%
주)청우이앤디 1
 
1.2%
주)청룡자원 1
 
1.2%
제이엔제이산업(주 1
 
1.2%
청룡시스템 1
 
1.2%
Other values (74) 74
87.1%
2024-03-14T23:42:54.054210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 54
 
8.8%
) 54
 
8.8%
52
 
8.5%
22
 
3.6%
16
 
2.6%
15
 
2.5%
14
 
2.3%
12
 
2.0%
12
 
2.0%
11
 
1.8%
Other values (132) 350
57.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 470
76.8%
Open Punctuation 54
 
8.8%
Close Punctuation 54
 
8.8%
Other Symbol 8
 
1.3%
Space Separator 7
 
1.1%
Lowercase Letter 7
 
1.1%
Other Punctuation 5
 
0.8%
Uppercase Letter 5
 
0.8%
Decimal Number 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
52
 
11.1%
22
 
4.7%
16
 
3.4%
15
 
3.2%
14
 
3.0%
12
 
2.6%
12
 
2.6%
11
 
2.3%
11
 
2.3%
11
 
2.3%
Other values (115) 294
62.6%
Lowercase Letter
ValueCountFrequency (%)
d 2
28.6%
o 2
28.6%
t 1
14.3%
c 1
14.3%
i 1
14.3%
Uppercase Letter
ValueCountFrequency (%)
L 2
40.0%
T 1
20.0%
E 1
20.0%
Y 1
20.0%
Other Punctuation
ValueCountFrequency (%)
. 4
80.0%
, 1
 
20.0%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 54
100.0%
Close Punctuation
ValueCountFrequency (%)
) 54
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 478
78.1%
Common 122
 
19.9%
Latin 12
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
52
 
10.9%
22
 
4.6%
16
 
3.3%
15
 
3.1%
14
 
2.9%
12
 
2.5%
12
 
2.5%
11
 
2.3%
11
 
2.3%
11
 
2.3%
Other values (116) 302
63.2%
Latin
ValueCountFrequency (%)
d 2
16.7%
o 2
16.7%
L 2
16.7%
T 1
8.3%
E 1
8.3%
Y 1
8.3%
t 1
8.3%
c 1
8.3%
i 1
8.3%
Common
ValueCountFrequency (%)
( 54
44.3%
) 54
44.3%
7
 
5.7%
. 4
 
3.3%
2 1
 
0.8%
1 1
 
0.8%
, 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 470
76.8%
ASCII 134
 
21.9%
None 8
 
1.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 54
40.3%
) 54
40.3%
7
 
5.2%
. 4
 
3.0%
d 2
 
1.5%
o 2
 
1.5%
L 2
 
1.5%
T 1
 
0.7%
E 1
 
0.7%
2 1
 
0.7%
Other values (6) 6
 
4.5%
Hangul
ValueCountFrequency (%)
52
 
11.1%
22
 
4.7%
16
 
3.4%
15
 
3.2%
14
 
3.0%
12
 
2.6%
12
 
2.6%
11
 
2.3%
11
 
2.3%
11
 
2.3%
Other values (115) 294
62.6%
None
ValueCountFrequency (%)
8
100.0%

주소
Text

MISSING 

Distinct78
Distinct (%)100.0%
Missing1
Missing (%)1.3%
Memory size760.0 B
2024-03-14T23:42:54.957372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length41
Mean length26.615385
Min length14

Characters and Unicode

Total characters2076
Distinct characters110
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique78 ?
Unique (%)100.0%

Sample

1st row인천광역시 서구 봉수대로1568번길 29(금곡동)
2nd row인천광역시 서구 두루물로 36(오류동)
3rd row인천광역시 서구 길무로 213(오류동)
4th row인천광역시 서구 원당대로 246번길 7(오류동)
5th row인천광역시 서구 금산로 9
ValueCountFrequency (%)
인천광역시 78
22.5%
서구 78
22.5%
두루물로 9
 
2.6%
금산로 5
 
1.4%
원당대로206번길 4
 
1.2%
두루물로96번길 3
 
0.9%
길무로 3
 
0.9%
봉수대로1568번길 3
 
0.9%
34(오류동 3
 
0.9%
서부자원순환특화산업단지 2
 
0.6%
Other values (139) 158
45.7%
2024-03-14T23:42:56.084070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
271
 
13.1%
91
 
4.4%
83
 
4.0%
81
 
3.9%
80
 
3.9%
80
 
3.9%
80
 
3.9%
78
 
3.8%
78
 
3.8%
78
 
3.8%
Other values (100) 1076
51.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1289
62.1%
Decimal Number 328
 
15.8%
Space Separator 271
 
13.1%
Close Punctuation 75
 
3.6%
Open Punctuation 75
 
3.6%
Dash Punctuation 17
 
0.8%
Other Punctuation 16
 
0.8%
Uppercase Letter 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
91
 
7.1%
83
 
6.4%
81
 
6.3%
80
 
6.2%
80
 
6.2%
80
 
6.2%
78
 
6.1%
78
 
6.1%
78
 
6.1%
60
 
4.7%
Other values (80) 500
38.8%
Decimal Number
ValueCountFrequency (%)
1 55
16.8%
2 48
14.6%
6 39
11.9%
3 37
11.3%
5 35
10.7%
4 34
10.4%
8 26
7.9%
7 20
 
6.1%
0 18
 
5.5%
9 16
 
4.9%
Close Punctuation
ValueCountFrequency (%)
) 74
98.7%
] 1
 
1.3%
Open Punctuation
ValueCountFrequency (%)
( 74
98.7%
[ 1
 
1.3%
Other Punctuation
ValueCountFrequency (%)
, 14
87.5%
: 2
 
12.5%
Uppercase Letter
ValueCountFrequency (%)
A 4
80.0%
B 1
 
20.0%
Space Separator
ValueCountFrequency (%)
271
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1289
62.1%
Common 782
37.7%
Latin 5
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
91
 
7.1%
83
 
6.4%
81
 
6.3%
80
 
6.2%
80
 
6.2%
80
 
6.2%
78
 
6.1%
78
 
6.1%
78
 
6.1%
60
 
4.7%
Other values (80) 500
38.8%
Common
ValueCountFrequency (%)
271
34.7%
) 74
 
9.5%
( 74
 
9.5%
1 55
 
7.0%
2 48
 
6.1%
6 39
 
5.0%
3 37
 
4.7%
5 35
 
4.5%
4 34
 
4.3%
8 26
 
3.3%
Other values (8) 89
 
11.4%
Latin
ValueCountFrequency (%)
A 4
80.0%
B 1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1289
62.1%
ASCII 787
37.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
271
34.4%
) 74
 
9.4%
( 74
 
9.4%
1 55
 
7.0%
2 48
 
6.1%
6 39
 
5.0%
3 37
 
4.7%
5 35
 
4.4%
4 34
 
4.3%
8 26
 
3.3%
Other values (10) 94
 
11.9%
Hangul
ValueCountFrequency (%)
91
 
7.1%
83
 
6.4%
81
 
6.3%
80
 
6.2%
80
 
6.2%
80
 
6.2%
78
 
6.1%
78
 
6.1%
78
 
6.1%
60
 
4.7%
Other values (80) 500
38.8%

데이터기준일자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size760.0 B
2023-10-26
78 
<NA>
 
1

Length

Max length10
Median length10
Mean length9.9240506
Min length4

Unique

Unique1 ?
Unique (%)1.3%

Sample

1st row2023-10-26
2nd row2023-10-26
3rd row2023-10-26
4th row2023-10-26
5th row2023-10-26

Common Values

ValueCountFrequency (%)
2023-10-26 78
98.7%
<NA> 1
 
1.3%

Length

2024-03-14T23:42:56.312750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T23:42:56.492723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-26 78
98.7%
na 1
 
1.3%

Interactions

2024-03-14T23:42:50.915742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T23:42:56.601145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호주소
연번1.0001.0001.000
상호1.0001.0001.000
주소1.0001.0001.000
2024-03-14T23:42:56.845281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번데이터기준일자
연번1.0001.000
데이터기준일자1.0001.000

Missing values

2024-03-14T23:42:51.138113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T23:42:51.284395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T23:42:51.444965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번상호주소데이터기준일자
01(주)에이원에코인천광역시 서구 봉수대로1568번길 29(금곡동)2023-10-26
12성진산업인천광역시 서구 두루물로 36(오류동)2023-10-26
23경진철강인천광역시 서구 길무로 213(오류동)2023-10-26
34(주)케이윤산업인천광역시 서구 원당대로 246번길 7(오류동)2023-10-26
46대신자원개발인천광역시 서구 금산로 92023-10-26
57유창테크인천광역시 서구 검단천로 153(오류동)2023-10-26
68(주)바로텍인천광역시 서구 가현산로46번안길 42 (대곡동)2023-10-26
79(주)이오니아이엔티(인천공장)인천광역시 서구 마중2로 14(오류동)2023-10-26
810(주)태초메탈인천광역시 서구 두루물로 78번길 26(오류동)2023-10-26
911(주)에코리사이클링인천광역시 서구 도담3로13(오류동)2023-10-26
연번상호주소데이터기준일자
6975청라환경개발(주)인천광역시 서구 금산로 15(경서동, 인천서부자원순환특화산업단지 A2-3)2023-10-26
7076부림인더스트리(주)인천광역시 서구 두루물로 96번길 262023-10-26
7177태영상사인천광역시 서구 금산로7번길23(경서동, 서부자원순환특화산업단지 A1블록 7,8로트)2023-10-26
7278㈜석천기업인천광역시 서구 검단천로 203(오류동)2023-10-26
7379(주)엔씨알인천광역시 서구 금산로 35 (경서동 1047-8, 서부자원순환특화단지)2023-10-26
7480경인케미컬인천광역시 서구 사월로 2(백석동)2023-10-26
7581우리환경개발㈜인천광역시 서구 금산로 28(경서동, 자원순환특화산업단지)2023-10-26
7682㈜동원에코인천광역시 서구 금산로 32(경서동, 자원순환특화산업단지)2023-10-26
7783정우메탈인천광역시 서구 파랑로116번안길 9 (원창동)2023-10-26
78<NA><NA><NA><NA>