Overview

Dataset statistics

Number of variables4
Number of observations401
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.0 KiB
Average record size in memory33.3 B

Variable types

Numeric1
Text2
DateTime1

Dataset

Description대구광역시 서구 내 폐수배출시설이 설치 되어 있는 현황에 대한 자료로 상호, 주소(도로명주소)의 항목을 제공합니다.
Author대구광역시 서구
URLhttps://www.data.go.kr/data/15084413/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 19:36:58.976042
Analysis finished2024-03-14 19:36:59.828395
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct401
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean201
Minimum1
Maximum401
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2024-03-15T04:36:59.964452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile21
Q1101
median201
Q3301
95-th percentile381
Maximum401
Range400
Interquartile range (IQR)200

Descriptive statistics

Standard deviation115.90298
Coefficient of variation (CV)0.57663172
Kurtosis-1.2
Mean201
Median Absolute Deviation (MAD)100
Skewness0
Sum80601
Variance13433.5
MonotonicityStrictly increasing
2024-03-15T04:37:00.424043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
265 1
 
0.2%
275 1
 
0.2%
274 1
 
0.2%
273 1
 
0.2%
272 1
 
0.2%
271 1
 
0.2%
270 1
 
0.2%
269 1
 
0.2%
268 1
 
0.2%
Other values (391) 391
97.5%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
401 1
0.2%
400 1
0.2%
399 1
0.2%
398 1
0.2%
397 1
0.2%
396 1
0.2%
395 1
0.2%
394 1
0.2%
393 1
0.2%
392 1
0.2%
Distinct393
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2024-03-15T04:37:01.264943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length20
Mean length6.723192
Min length2

Characters and Unicode

Total characters2696
Distinct characters284
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique387 ?
Unique (%)96.5%

Sample

1st row대동실업
2nd row(주)동진상사
3rd row(주)동화다이텍
4th row대하세차장
5th row(주)비케이글로벌
ValueCountFrequency (%)
주식회사 7
 
1.6%
dyetec연구원 4
 
0.9%
중앙모터스 3
 
0.7%
주)동아지질 2
 
0.5%
비산공장 2
 
0.5%
이현공장 2
 
0.5%
차고지 2
 
0.5%
현대다이텍(주 2
 
0.5%
주)미앤부티 2
 
0.5%
동덕섬유 2
 
0.5%
Other values (405) 407
93.6%
2024-03-15T04:37:02.469526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
197
 
7.3%
( 175
 
6.5%
) 175
 
6.5%
64
 
2.4%
62
 
2.3%
59
 
2.2%
55
 
2.0%
49
 
1.8%
44
 
1.6%
44
 
1.6%
Other values (274) 1772
65.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2213
82.1%
Open Punctuation 175
 
6.5%
Close Punctuation 175
 
6.5%
Uppercase Letter 85
 
3.2%
Space Separator 34
 
1.3%
Decimal Number 7
 
0.3%
Other Symbol 2
 
0.1%
Dash Punctuation 2
 
0.1%
Other Punctuation 2
 
0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
197
 
8.9%
64
 
2.9%
62
 
2.8%
59
 
2.7%
55
 
2.5%
49
 
2.2%
44
 
2.0%
44
 
2.0%
43
 
1.9%
41
 
1.9%
Other values (242) 1555
70.3%
Uppercase Letter
ValueCountFrequency (%)
E 14
16.5%
T 12
14.1%
C 10
11.8%
D 7
 
8.2%
I 5
 
5.9%
Y 5
 
5.9%
M 4
 
4.7%
P 4
 
4.7%
H 3
 
3.5%
L 3
 
3.5%
Other values (11) 18
21.2%
Decimal Number
ValueCountFrequency (%)
2 4
57.1%
1 2
28.6%
4 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
& 1
50.0%
. 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 175
100.0%
Close Punctuation
ValueCountFrequency (%)
) 175
100.0%
Space Separator
ValueCountFrequency (%)
34
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
r 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2215
82.2%
Common 395
 
14.7%
Latin 86
 
3.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
197
 
8.9%
64
 
2.9%
62
 
2.8%
59
 
2.7%
55
 
2.5%
49
 
2.2%
44
 
2.0%
44
 
2.0%
43
 
1.9%
41
 
1.9%
Other values (243) 1557
70.3%
Latin
ValueCountFrequency (%)
E 14
16.3%
T 12
14.0%
C 10
11.6%
D 7
 
8.1%
I 5
 
5.8%
Y 5
 
5.8%
M 4
 
4.7%
P 4
 
4.7%
H 3
 
3.5%
L 3
 
3.5%
Other values (12) 19
22.1%
Common
ValueCountFrequency (%)
( 175
44.3%
) 175
44.3%
34
 
8.6%
2 4
 
1.0%
1 2
 
0.5%
- 2
 
0.5%
& 1
 
0.3%
4 1
 
0.3%
. 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2213
82.1%
ASCII 481
 
17.8%
None 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
197
 
8.9%
64
 
2.9%
62
 
2.8%
59
 
2.7%
55
 
2.5%
49
 
2.2%
44
 
2.0%
44
 
2.0%
43
 
1.9%
41
 
1.9%
Other values (242) 1555
70.3%
ASCII
ValueCountFrequency (%)
( 175
36.4%
) 175
36.4%
34
 
7.1%
E 14
 
2.9%
T 12
 
2.5%
C 10
 
2.1%
D 7
 
1.5%
I 5
 
1.0%
Y 5
 
1.0%
2 4
 
0.8%
Other values (21) 40
 
8.3%
None
ValueCountFrequency (%)
2
100.0%
Distinct387
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2024-03-15T04:37:03.550958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length39
Mean length25.099751
Min length20

Characters and Unicode

Total characters10065
Distinct characters81
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique374 ?
Unique (%)93.3%

Sample

1st row대구광역시 서구 염색공단중앙로 77 (비산동)
2nd row대구광역시 서구 염색공단천로7길 10 (비산동)
3rd row대구광역시 서구 염색공단천로 45 (비산동)
4th row대구광역시 서구 평리로35길 67 (중리동)
5th row대구광역시 서구 염색공단중앙로22길 15 (비산동)
ValueCountFrequency (%)
대구광역시 401
19.7%
서구 401
19.7%
비산동 172
 
8.5%
이현동 103
 
5.1%
중리동 78
 
3.8%
평리동 38
 
1.9%
달서천로 23
 
1.1%
염색공단로 21
 
1.0%
와룡로 21
 
1.0%
염색공단중앙로 18
 
0.9%
Other values (301) 758
37.3%
2024-03-15T04:37:05.469421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1662
 
16.5%
819
 
8.1%
440
 
4.4%
424
 
4.2%
404
 
4.0%
401
 
4.0%
401
 
4.0%
401
 
4.0%
( 401
 
4.0%
) 400
 
4.0%
Other values (71) 4312
42.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6130
60.9%
Space Separator 1662
 
16.5%
Decimal Number 1411
 
14.0%
Open Punctuation 401
 
4.0%
Close Punctuation 400
 
4.0%
Dash Punctuation 60
 
0.6%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
819
13.4%
440
 
7.2%
424
 
6.9%
404
 
6.6%
401
 
6.5%
401
 
6.5%
401
 
6.5%
399
 
6.5%
242
 
3.9%
196
 
3.2%
Other values (56) 2003
32.7%
Decimal Number
ValueCountFrequency (%)
1 340
24.1%
2 218
15.5%
3 169
12.0%
6 120
 
8.5%
7 118
 
8.4%
4 105
 
7.4%
8 89
 
6.3%
9 86
 
6.1%
0 86
 
6.1%
5 80
 
5.7%
Space Separator
ValueCountFrequency (%)
1662
100.0%
Open Punctuation
ValueCountFrequency (%)
( 401
100.0%
Close Punctuation
ValueCountFrequency (%)
) 400
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6130
60.9%
Common 3935
39.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
819
13.4%
440
 
7.2%
424
 
6.9%
404
 
6.6%
401
 
6.5%
401
 
6.5%
401
 
6.5%
399
 
6.5%
242
 
3.9%
196
 
3.2%
Other values (56) 2003
32.7%
Common
ValueCountFrequency (%)
1662
42.2%
( 401
 
10.2%
) 400
 
10.2%
1 340
 
8.6%
2 218
 
5.5%
3 169
 
4.3%
6 120
 
3.0%
7 118
 
3.0%
4 105
 
2.7%
8 89
 
2.3%
Other values (5) 313
 
8.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6130
60.9%
ASCII 3935
39.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1662
42.2%
( 401
 
10.2%
) 400
 
10.2%
1 340
 
8.6%
2 218
 
5.5%
3 169
 
4.3%
6 120
 
3.0%
7 118
 
3.0%
4 105
 
2.7%
8 89
 
2.3%
Other values (5) 313
 
8.0%
Hangul
ValueCountFrequency (%)
819
13.4%
440
 
7.2%
424
 
6.9%
404
 
6.6%
401
 
6.5%
401
 
6.5%
401
 
6.5%
399
 
6.5%
242
 
3.9%
196
 
3.2%
Other values (56) 2003
32.7%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
Minimum2024-03-01 00:00:00
Maximum2024-03-01 00:00:00
2024-03-15T04:37:05.836848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:37:06.257696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-15T04:36:59.322637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-03-15T04:36:59.617564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T04:36:59.766295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명도로명소재지데이터기준일자
01대동실업대구광역시 서구 염색공단중앙로 77 (비산동)2024-03-01
12(주)동진상사대구광역시 서구 염색공단천로7길 10 (비산동)2024-03-01
23(주)동화다이텍대구광역시 서구 염색공단천로 45 (비산동)2024-03-01
34대하세차장대구광역시 서구 평리로35길 67 (중리동)2024-03-01
45(주)비케이글로벌대구광역시 서구 염색공단중앙로22길 15 (비산동)2024-03-01
56진호텍(주)대구광역시 서구 염색공단중앙로14길 12 (비산동)2024-03-01
67(주)무길염공대구광역시 서구 염색공단로 107 (비산동)2024-03-01
78유피엔대구광역시 서구 염색공단천로11길 10 (비산동)2024-03-01
89신창염직(주)대구광역시 서구 염색공단로 104 (비산동)2024-03-01
910(주)홍창대구광역시 서구 평리로33길 17 (중리동)2024-03-01
연번사업장명도로명소재지데이터기준일자
391392서구청 청소차 차고지대구광역시 서구 달서천로 16-6 (이현동)2024-03-01
392393(주)대구환경연구소대구광역시 서구 염색공단천로14길 12 (비산동)2024-03-01
393394(주)한국소방기구제작소대구광역시 서구 평리로29길 33 (중리동)2024-03-01
394395(주)동아지질대구광역시 서구 비산동 62-2 도로2024-03-01
395396(주)동아지질대구광역시 서구 비산동 498-1 도로2024-03-01
396397태룡상사대구광역시 서구 염색공단로 54 (비산동)2024-03-01
397398미스터(Mr)세차왕대구광역시 서구 원대로 59 (원대동1가)2024-03-01
398399아진TEX대구광역시 서구 달서천로 76 (평리동)2024-03-01
399400태산ENC대구광역시 서구 염색공단천로14길 16-11 외 1필지 (비산동)2024-03-01
400401평화카워시대구광역시 서구 평리로35길 14 (중리동)2024-03-01