Overview

Dataset statistics

Number of variables5
Number of observations44
Missing cells5
Missing cells (%)2.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory44.0 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description인천광역시 서구 건설폐기물 수집운반업체 현황에 대한 데이터로 업체명, 주소, 연락처, 데이터 기준일자 등이 포함되어 있습니다.
Author인천광역시 서구
URLhttps://www.data.go.kr/data/15090726/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연락처 has 5 (11.4%) missing valuesMissing
연번 has unique valuesUnique
업체명 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2024-04-17 10:20:05.378074
Analysis finished2024-04-17 10:20:05.753476
Duration0.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.5
Minimum1
Maximum44
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size528.0 B
2024-04-17T19:20:05.817826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.15
Q111.75
median22.5
Q333.25
95-th percentile41.85
Maximum44
Range43
Interquartile range (IQR)21.5

Descriptive statistics

Standard deviation12.845233
Coefficient of variation (CV)0.57089923
Kurtosis-1.2
Mean22.5
Median Absolute Deviation (MAD)11
Skewness0
Sum990
Variance165
MonotonicityStrictly increasing
2024-04-17T19:20:05.938141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
ValueCountFrequency (%)
1 1
 
2.3%
24 1
 
2.3%
26 1
 
2.3%
27 1
 
2.3%
28 1
 
2.3%
29 1
 
2.3%
30 1
 
2.3%
31 1
 
2.3%
32 1
 
2.3%
33 1
 
2.3%
Other values (34) 34
77.3%
ValueCountFrequency (%)
1 1
2.3%
2 1
2.3%
3 1
2.3%
4 1
2.3%
5 1
2.3%
6 1
2.3%
7 1
2.3%
8 1
2.3%
9 1
2.3%
10 1
2.3%
ValueCountFrequency (%)
44 1
2.3%
43 1
2.3%
42 1
2.3%
41 1
2.3%
40 1
2.3%
39 1
2.3%
38 1
2.3%
37 1
2.3%
36 1
2.3%
35 1
2.3%

업체명
Text

UNIQUE 

Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size484.0 B
2024-04-17T19:20:06.134875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length7.2954545
Min length4

Characters and Unicode

Total characters321
Distinct characters97
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)100.0%

Sample

1st row(주)장형기업
2nd row(주)아이케이
3rd row(주)부성환경
4th row(주)이도
5th row엔지오종합개발(주)
ValueCountFrequency (%)
주)장형기업 1
 
2.3%
주)아이케이 1
 
2.3%
일호산업(주 1
 
2.3%
주)유성엔텍 1
 
2.3%
조은토건(주 1
 
2.3%
주)리더로지스틱 1
 
2.3%
주)벧엘건설 1
 
2.3%
주)삼미재생산업 1
 
2.3%
태민공사 1
 
2.3%
강서환경 1
 
2.3%
Other values (34) 34
77.3%
2024-04-17T19:20:06.459098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 39
 
12.1%
) 39
 
12.1%
37
 
11.5%
11
 
3.4%
10
 
3.1%
9
 
2.8%
9
 
2.8%
8
 
2.5%
7
 
2.2%
7
 
2.2%
Other values (87) 145
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 239
74.5%
Open Punctuation 39
 
12.1%
Close Punctuation 39
 
12.1%
Uppercase Letter 2
 
0.6%
Other Symbol 1
 
0.3%
Space Separator 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
 
15.5%
11
 
4.6%
10
 
4.2%
9
 
3.8%
9
 
3.8%
8
 
3.3%
7
 
2.9%
7
 
2.9%
7
 
2.9%
6
 
2.5%
Other values (81) 128
53.6%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
D 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 240
74.8%
Common 79
 
24.6%
Latin 2
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
 
15.4%
11
 
4.6%
10
 
4.2%
9
 
3.8%
9
 
3.8%
8
 
3.3%
7
 
2.9%
7
 
2.9%
7
 
2.9%
6
 
2.5%
Other values (82) 129
53.8%
Common
ValueCountFrequency (%)
( 39
49.4%
) 39
49.4%
1
 
1.3%
Latin
ValueCountFrequency (%)
S 1
50.0%
D 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 239
74.5%
ASCII 81
 
25.2%
None 1
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 39
48.1%
) 39
48.1%
S 1
 
1.2%
D 1
 
1.2%
1
 
1.2%
Hangul
ValueCountFrequency (%)
37
 
15.5%
11
 
4.6%
10
 
4.2%
9
 
3.8%
9
 
3.8%
8
 
3.3%
7
 
2.9%
7
 
2.9%
7
 
2.9%
6
 
2.5%
Other values (81) 128
53.6%
None
ValueCountFrequency (%)
1
100.0%

주소
Text

UNIQUE 

Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size484.0 B
2024-04-17T19:20:06.705618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length36
Mean length28.636364
Min length19

Characters and Unicode

Total characters1260
Distinct characters107
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)100.0%

Sample

1st row인천광역시 서구 검단천로 203 (오류동)
2nd row인천광역시 서구 검단천로 151 (오류동)
3rd row인천광역시 서구 왕길동 64-101, 1동
4th row인천광역시 서구 드림로 174 (백석동)
5th row인천광역시 서구 승학로 426, 304호 (검암동, 우주프라자)
ValueCountFrequency (%)
인천광역시 44
 
17.3%
서구 44
 
17.3%
검암동 6
 
2.4%
오류동 6
 
2.4%
승학로 5
 
2.0%
완정로 5
 
2.0%
1동 5
 
2.0%
158 4
 
1.6%
마전동 4
 
1.6%
왕길동 4
 
1.6%
Other values (105) 127
50.0%
2024-04-17T19:20:07.083809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
210
 
16.7%
1 49
 
3.9%
47
 
3.7%
46
 
3.7%
44
 
3.5%
44
 
3.5%
44
 
3.5%
44
 
3.5%
44
 
3.5%
44
 
3.5%
Other values (97) 644
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 693
55.0%
Decimal Number 237
 
18.8%
Space Separator 210
 
16.7%
Other Punctuation 42
 
3.3%
Open Punctuation 36
 
2.9%
Close Punctuation 36
 
2.9%
Dash Punctuation 6
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
6.8%
46
 
6.6%
44
 
6.3%
44
 
6.3%
44
 
6.3%
44
 
6.3%
44
 
6.3%
44
 
6.3%
42
 
6.1%
22
 
3.2%
Other values (82) 272
39.2%
Decimal Number
ValueCountFrequency (%)
1 49
20.7%
0 30
12.7%
2 27
11.4%
5 25
10.5%
6 23
9.7%
4 21
8.9%
7 20
8.4%
3 17
 
7.2%
9 13
 
5.5%
8 12
 
5.1%
Space Separator
ValueCountFrequency (%)
210
100.0%
Other Punctuation
ValueCountFrequency (%)
, 42
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 693
55.0%
Common 567
45.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
6.8%
46
 
6.6%
44
 
6.3%
44
 
6.3%
44
 
6.3%
44
 
6.3%
44
 
6.3%
44
 
6.3%
42
 
6.1%
22
 
3.2%
Other values (82) 272
39.2%
Common
ValueCountFrequency (%)
210
37.0%
1 49
 
8.6%
, 42
 
7.4%
( 36
 
6.3%
) 36
 
6.3%
0 30
 
5.3%
2 27
 
4.8%
5 25
 
4.4%
6 23
 
4.1%
4 21
 
3.7%
Other values (5) 68
 
12.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 693
55.0%
ASCII 567
45.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
210
37.0%
1 49
 
8.6%
, 42
 
7.4%
( 36
 
6.3%
) 36
 
6.3%
0 30
 
5.3%
2 27
 
4.8%
5 25
 
4.4%
6 23
 
4.1%
4 21
 
3.7%
Other values (5) 68
 
12.0%
Hangul
ValueCountFrequency (%)
47
 
6.8%
46
 
6.6%
44
 
6.3%
44
 
6.3%
44
 
6.3%
44
 
6.3%
44
 
6.3%
44
 
6.3%
42
 
6.1%
22
 
3.2%
Other values (82) 272
39.2%

연락처
Text

MISSING 

Distinct39
Distinct (%)100.0%
Missing5
Missing (%)11.4%
Memory size484.0 B
2024-04-17T19:20:07.287170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters468
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)100.0%

Sample

1st row032-562-1658
2nd row032-563-3114
3rd row032-568-1009
4th row032-567-0181
5th row032-569-4578
ValueCountFrequency (%)
032-563-3114 1
 
2.6%
032-562-1658 1
 
2.6%
032-565-0071 1
 
2.6%
032-565-9110 1
 
2.6%
032-582-6811 1
 
2.6%
031-971-8831 1
 
2.6%
032-565-0541 1
 
2.6%
032-566-9750 1
 
2.6%
032-264-0075 1
 
2.6%
032-565-0701 1
 
2.6%
Other values (29) 29
74.4%
2024-04-17T19:20:07.558741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 78
16.7%
0 64
13.7%
2 64
13.7%
5 63
13.5%
3 55
11.8%
6 42
9.0%
1 31
 
6.6%
7 31
 
6.6%
4 16
 
3.4%
8 15
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 390
83.3%
Dash Punctuation 78
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 64
16.4%
2 64
16.4%
5 63
16.2%
3 55
14.1%
6 42
10.8%
1 31
7.9%
7 31
7.9%
4 16
 
4.1%
8 15
 
3.8%
9 9
 
2.3%
Dash Punctuation
ValueCountFrequency (%)
- 78
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 468
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 78
16.7%
0 64
13.7%
2 64
13.7%
5 63
13.5%
3 55
11.8%
6 42
9.0%
1 31
 
6.6%
7 31
 
6.6%
4 16
 
3.4%
8 15
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 468
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 78
16.7%
0 64
13.7%
2 64
13.7%
5 63
13.5%
3 55
11.8%
6 42
9.0%
1 31
 
6.6%
7 31
 
6.6%
4 16
 
3.4%
8 15
 
3.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size484.0 B
2023-10-26
44 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-26
2nd row2023-10-26
3rd row2023-10-26
4th row2023-10-26
5th row2023-10-26

Common Values

ValueCountFrequency (%)
2023-10-26 44
100.0%

Length

2024-04-17T19:20:07.675989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T19:20:07.750298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-26 44
100.0%

Interactions

2024-04-17T19:20:05.548531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T19:20:07.798559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명주소연락처
연번1.0001.0001.0001.000
업체명1.0001.0001.0001.000
주소1.0001.0001.0001.000
연락처1.0001.0001.0001.000

Missing values

2024-04-17T19:20:05.640643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T19:20:05.720221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명주소연락처데이터기준일자
01(주)장형기업인천광역시 서구 검단천로 203 (오류동)032-562-16582023-10-26
12(주)아이케이인천광역시 서구 검단천로 151 (오류동)032-563-31142023-10-26
23(주)부성환경인천광역시 서구 왕길동 64-101, 1동032-568-10092023-10-26
34(주)이도인천광역시 서구 드림로 174 (백석동)032-567-01812023-10-26
45엔지오종합개발(주)인천광역시 서구 승학로 426, 304호 (검암동, 우주프라자)032-569-45782023-10-26
56천리운송(주)인천광역시 서구 승학로495번길 4-1 (검암동)032-566-25222023-10-26
67진현토건(주)인천광역시 서구 중봉대로 799 (경서동)032-564-07052023-10-26
78승훈산업(주)인천광역시 서구 완정로146, 702호 (마전동)032-577-31462023-10-26
89(주)정암건설인천광역시 서구 독정로 6, 3층 (백석동)032-562-17702023-10-26
910(주)세기산업개발인천광역시 서구 염곡로 464번길 15, 907호 (가정동)032-529-44222023-10-26
연번업체명주소연락처데이터기준일자
3435(주)대건산업개발인천광역시 서구 두루물로96번길1, 2층032-563-57382023-10-26
3536에스디(SD)개발인천광역시 서구 금곡동 678-17032-569-22532023-10-26
3637(주)에이원환경인천광역시 서구 완정로 158, 701호 (마전동, 중앙빌딩)032-562-75722023-10-26
3738아라개발(주)인천광역시 서구 승학로 551, 303-2호 (검암동, 동곡프라자)032-710-56772023-10-26
3839동아공사(주)인천광역시 서구 드림로 176 (백석동)032-565-07012023-10-26
3940장형산업개발(주)인천광역시 서구 경명대로694번길 1, 201호 (공촌동)032-552-62472023-10-26
4041협신환경인천광역시 서구 원당대로 865, 504호(원당동, 대신프라자)<NA>2023-10-26
4142선진환경인천광역시 서구 검단로623번길7, 2층(마전동, 한중프라자)<NA>2023-10-26
4243㈜피에이치산업개발인천광역시 서구 염곡로498번안길 20-1, 501호 (가정동, 엠에스프라자)<NA>2023-10-26
4344충북자원인천광역시 서구 여우재로86번길 27,1층(가좌동)<NA>2023-10-26