Overview

Dataset statistics

Number of variables5
Number of observations48
Missing cells5
Missing cells (%)2.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory43.8 B

Variable types

Numeric1
Text3
DateTime1

Dataset

Description인천광역시 서구 건설폐기물 수집운반업체 현황에 대한 데이터로 업체명, 주소, 연락처, 데이터 기준일자 등이 포함되어 있습니다.
Author인천광역시 서구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15090726&srcSe=7661IVAWM27C61E190

Alerts

데이터기준일자 has constant value ""Constant
연락처 has 5 (10.4%) missing valuesMissing
연번 has unique valuesUnique
업체명 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2024-01-28 11:01:24.171521
Analysis finished2024-01-28 11:01:24.579296
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct48
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.5
Minimum1
Maximum48
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2024-01-28T20:01:24.636188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.35
Q112.75
median24.5
Q336.25
95-th percentile45.65
Maximum48
Range47
Interquartile range (IQR)23.5

Descriptive statistics

Standard deviation14
Coefficient of variation (CV)0.57142857
Kurtosis-1.2
Mean24.5
Median Absolute Deviation (MAD)12
Skewness0
Sum1176
Variance196
MonotonicityStrictly increasing
2024-01-28T20:01:24.741585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
1 1
 
2.1%
26 1
 
2.1%
28 1
 
2.1%
29 1
 
2.1%
30 1
 
2.1%
31 1
 
2.1%
32 1
 
2.1%
33 1
 
2.1%
34 1
 
2.1%
35 1
 
2.1%
Other values (38) 38
79.2%
ValueCountFrequency (%)
1 1
2.1%
2 1
2.1%
3 1
2.1%
4 1
2.1%
5 1
2.1%
6 1
2.1%
7 1
2.1%
8 1
2.1%
9 1
2.1%
10 1
2.1%
ValueCountFrequency (%)
48 1
2.1%
47 1
2.1%
46 1
2.1%
45 1
2.1%
44 1
2.1%
43 1
2.1%
42 1
2.1%
41 1
2.1%
40 1
2.1%
39 1
2.1%

업체명
Text

UNIQUE 

Distinct48
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size516.0 B
2024-01-28T20:01:24.940069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length7.3541667
Min length4

Characters and Unicode

Total characters353
Distinct characters101
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)100.0%

Sample

1st row(주)장형기업
2nd row(주)아이케이
3rd row재윤개발(주)
4th row(주)부성환경
5th row(주)이도
ValueCountFrequency (%)
주)장형기업 1
 
2.1%
주)아이케이 1
 
2.1%
일호산업(주 1
 
2.1%
주)한성기업 1
 
2.1%
주)유성엔텍 1
 
2.1%
조은토건(주 1
 
2.1%
주)리더로지스틱 1
 
2.1%
주)벧엘건설 1
 
2.1%
주)삼미재생산업 1
 
2.1%
태민공사 1
 
2.1%
Other values (38) 38
79.2%
2024-01-28T20:01:25.222796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 43
 
12.2%
) 43
 
12.2%
41
 
11.6%
12
 
3.4%
11
 
3.1%
11
 
3.1%
11
 
3.1%
9
 
2.5%
8
 
2.3%
7
 
2.0%
Other values (91) 157
44.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 263
74.5%
Open Punctuation 43
 
12.2%
Close Punctuation 43
 
12.2%
Uppercase Letter 2
 
0.6%
Other Symbol 1
 
0.3%
Space Separator 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
15.6%
12
 
4.6%
11
 
4.2%
11
 
4.2%
11
 
4.2%
9
 
3.4%
8
 
3.0%
7
 
2.7%
7
 
2.7%
6
 
2.3%
Other values (85) 140
53.2%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
D 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 43
100.0%
Close Punctuation
ValueCountFrequency (%)
) 43
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 264
74.8%
Common 87
 
24.6%
Latin 2
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
15.5%
12
 
4.5%
11
 
4.2%
11
 
4.2%
11
 
4.2%
9
 
3.4%
8
 
3.0%
7
 
2.7%
7
 
2.7%
6
 
2.3%
Other values (86) 141
53.4%
Common
ValueCountFrequency (%)
( 43
49.4%
) 43
49.4%
1
 
1.1%
Latin
ValueCountFrequency (%)
S 1
50.0%
D 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 263
74.5%
ASCII 89
 
25.2%
None 1
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 43
48.3%
) 43
48.3%
S 1
 
1.1%
D 1
 
1.1%
1
 
1.1%
Hangul
ValueCountFrequency (%)
41
 
15.6%
12
 
4.6%
11
 
4.2%
11
 
4.2%
11
 
4.2%
9
 
3.4%
8
 
3.0%
7
 
2.7%
7
 
2.7%
6
 
2.3%
Other values (85) 140
53.2%
None
ValueCountFrequency (%)
1
100.0%

주소
Text

UNIQUE 

Distinct48
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size516.0 B
2024-01-28T20:01:25.443057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length36
Mean length28.479167
Min length19

Characters and Unicode

Total characters1367
Distinct characters108
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)100.0%

Sample

1st row인천광역시 서구 검단천로 203 (오류동)
2nd row인천광역시 서구 검단천로 151 (오류동)
3rd row인천광역시 서구 승학로 238 (심곡동)
4th row인천광역시 서구 왕길동 64-101, 1동
5th row인천광역시 서구 드림로 174 (백석동)
ValueCountFrequency (%)
인천광역시 48
 
17.3%
서구 48
 
17.3%
승학로 7
 
2.5%
검암동 7
 
2.5%
오류동 6
 
2.2%
1동 5
 
1.8%
완정로 5
 
1.8%
마전동 4
 
1.4%
158 4
 
1.4%
왕길동 4
 
1.4%
Other values (113) 139
50.2%
2024-01-28T20:01:25.768808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
230
 
16.8%
51
 
3.7%
1 51
 
3.7%
50
 
3.7%
48
 
3.5%
48
 
3.5%
48
 
3.5%
48
 
3.5%
48
 
3.5%
48
 
3.5%
Other values (98) 697
51.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 750
54.9%
Decimal Number 259
 
18.9%
Space Separator 230
 
16.8%
Other Punctuation 44
 
3.2%
Open Punctuation 39
 
2.9%
Close Punctuation 39
 
2.9%
Dash Punctuation 6
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
 
6.8%
50
 
6.7%
48
 
6.4%
48
 
6.4%
48
 
6.4%
48
 
6.4%
48
 
6.4%
48
 
6.4%
46
 
6.1%
24
 
3.2%
Other values (83) 291
38.8%
Decimal Number
ValueCountFrequency (%)
1 51
19.7%
0 36
13.9%
2 29
11.2%
5 28
10.8%
6 24
9.3%
3 23
8.9%
4 21
8.1%
7 20
 
7.7%
9 14
 
5.4%
8 13
 
5.0%
Space Separator
ValueCountFrequency (%)
230
100.0%
Other Punctuation
ValueCountFrequency (%)
, 44
100.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 750
54.9%
Common 617
45.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
 
6.8%
50
 
6.7%
48
 
6.4%
48
 
6.4%
48
 
6.4%
48
 
6.4%
48
 
6.4%
48
 
6.4%
46
 
6.1%
24
 
3.2%
Other values (83) 291
38.8%
Common
ValueCountFrequency (%)
230
37.3%
1 51
 
8.3%
, 44
 
7.1%
( 39
 
6.3%
) 39
 
6.3%
0 36
 
5.8%
2 29
 
4.7%
5 28
 
4.5%
6 24
 
3.9%
3 23
 
3.7%
Other values (5) 74
 
12.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 750
54.9%
ASCII 617
45.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
230
37.3%
1 51
 
8.3%
, 44
 
7.1%
( 39
 
6.3%
) 39
 
6.3%
0 36
 
5.8%
2 29
 
4.7%
5 28
 
4.5%
6 24
 
3.9%
3 23
 
3.7%
Other values (5) 74
 
12.0%
Hangul
ValueCountFrequency (%)
51
 
6.8%
50
 
6.7%
48
 
6.4%
48
 
6.4%
48
 
6.4%
48
 
6.4%
48
 
6.4%
48
 
6.4%
46
 
6.1%
24
 
3.2%
Other values (83) 291
38.8%

연락처
Text

MISSING 

Distinct43
Distinct (%)100.0%
Missing5
Missing (%)10.4%
Memory size516.0 B
2024-01-28T20:01:25.957104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters516
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row032-562-1658
2nd row032-563-3114
3rd row032-568-0194
4th row032-568-1009
5th row032-567-0181
ValueCountFrequency (%)
032-563-3114 1
 
2.3%
032-562-1658 1
 
2.3%
032-325-1608 1
 
2.3%
032-565-0071 1
 
2.3%
032-565-9110 1
 
2.3%
032-582-6811 1
 
2.3%
031-971-8831 1
 
2.3%
032-565-0541 1
 
2.3%
032-566-9750 1
 
2.3%
032-264-0075 1
 
2.3%
Other values (33) 33
76.7%
2024-01-28T20:01:26.225832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 86
16.7%
0 71
13.8%
2 68
13.2%
5 65
12.6%
3 63
12.2%
6 50
9.7%
1 33
 
6.4%
7 32
 
6.2%
8 20
 
3.9%
4 17
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 430
83.3%
Dash Punctuation 86
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 71
16.5%
2 68
15.8%
5 65
15.1%
3 63
14.7%
6 50
11.6%
1 33
7.7%
7 32
7.4%
8 20
 
4.7%
4 17
 
4.0%
9 11
 
2.6%
Dash Punctuation
ValueCountFrequency (%)
- 86
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 516
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 86
16.7%
0 71
13.8%
2 68
13.2%
5 65
12.6%
3 63
12.2%
6 50
9.7%
1 33
 
6.4%
7 32
 
6.2%
8 20
 
3.9%
4 17
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 516
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 86
16.7%
0 71
13.8%
2 68
13.2%
5 65
12.6%
3 63
12.2%
6 50
9.7%
1 33
 
6.4%
7 32
 
6.2%
8 20
 
3.9%
4 17
 
3.3%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size516.0 B
Minimum2022-09-02 00:00:00
Maximum2022-09-02 00:00:00
2024-01-28T20:01:26.312572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T20:01:26.378636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-28T20:01:24.381953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T20:01:26.430530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명주소연락처
연번1.0001.0001.0001.000
업체명1.0001.0001.0001.000
주소1.0001.0001.0001.000
연락처1.0001.0001.0001.000

Missing values

2024-01-28T20:01:24.481477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T20:01:24.552218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명주소연락처데이터기준일자
01(주)장형기업인천광역시 서구 검단천로 203 (오류동)032-562-16582022-09-02
12(주)아이케이인천광역시 서구 검단천로 151 (오류동)032-563-31142022-09-02
23재윤개발(주)인천광역시 서구 승학로 238 (심곡동)032-568-01942022-09-02
34(주)부성환경인천광역시 서구 왕길동 64-101, 1동032-568-10092022-09-02
45(주)이도인천광역시 서구 드림로 174 (백석동)032-567-01812022-09-02
56엔지오종합개발(주)인천광역시 서구 승학로 426, 304호 (검암동, 우주프라자)032-569-45782022-09-02
67천리운송(주)인천광역시 서구 승학로495번길 4-1 (검암동)032-566-25222022-09-02
78진현토건(주)인천광역시 서구 중봉대로 799 (경서동)032-564-07052022-09-02
89승훈산업(주)인천광역시 서구 완정로146, 702호 (마전동)032-577-31462022-09-02
910(주)정암건설인천광역시 서구 독정로 6, 3층 (백석동)032-562-17702022-09-02
연번업체명주소연락처데이터기준일자
3839에스디(SD)개발인천광역시 서구 금곡동 678-17032-569-22532022-09-02
3940(주)에이원환경인천광역시 서구 완정로 158, 701호 (마전동, 중앙빌딩)032-562-75722022-09-02
4041아라개발(주)인천광역시 서구 승학로 551, 303-2호 (검암동, 동곡프라자)032-710-56772022-09-02
4142동아공사(주)인천광역시 서구 드림로 176 (백석동)032-565-07012022-09-02
4243(주)화승산업개발인천광역시 서구 봉오재3로 100, 5층 503호032-886-63012022-09-02
4344장형산업개발(주)인천광역시 서구 경명대로694번길 1, 201호 (공촌동)032-552-62472022-09-02
4445협신환경인천광역시 서구 원당대로 865, 504호(원당동, 대신프라자)<NA>2022-09-02
4546선진환경인천광역시 서구 검단로623번길7, 2층(마전동, 한중프라자)<NA>2022-09-02
4647㈜피에이치산업개발인천광역시 서구 염곡로498번안길 20-1, 501호 (가정동, 엠에스프라자)<NA>2022-09-02
4748충북자원인천광역시 서구 여우재로86번길 27,1층(가좌동)<NA>2022-09-02