Overview

Dataset statistics

Number of variables6
Number of observations56
Missing cells5
Missing cells (%)1.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory50.4 B

Variable types

Text3
Categorical2
DateTime1

Dataset

Description대구광역시_북구_소음진동배출시설_20190910
Author대구광역시 북구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15006283&dataSetDetailId=150062832b815679c2214_201909111352&provdMethod=FILE

Alerts

데이터기준일자 has constant value ""Constant
소음진동 is highly overall correlated with 지역구분High correlation
지역구분 is highly overall correlated with 소음진동High correlation
전화번호 has 5 (8.9%) missing valuesMissing
소재지도로명주소 has unique valuesUnique

Reproduction

Analysis started2023-12-10 18:42:23.659619
Analysis finished2023-12-10 18:42:26.139575
Duration2.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct55
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size580.0 B
2023-12-11T03:42:26.419238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length4
Mean length4.9821429
Min length4

Characters and Unicode

Total characters279
Distinct characters96
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)96.4%

Sample

1st row성수직물공장
2nd row청안도금
3rd row대영플라스틱
4th row아주직물
5th row선미산업(주)
ValueCountFrequency (%)
제일정공 2
 
3.5%
유영나염 1
 
1.8%
명심테크 1
 
1.8%
대광정밀부식공예 1
 
1.8%
㈜에이제트 1
 
1.8%
은성테크 1
 
1.8%
세진정공 1
 
1.8%
은성이엔티 1
 
1.8%
한국유체기술 1
 
1.8%
동양산업 1
 
1.8%
Other values (46) 46
80.7%
2023-12-11T03:42:27.056396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
4.7%
12
 
4.3%
11
 
3.9%
11
 
3.9%
11
 
3.9%
9
 
3.2%
8
 
2.9%
7
 
2.5%
7
 
2.5%
) 6
 
2.2%
Other values (86) 184
65.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 263
94.3%
Close Punctuation 6
 
2.2%
Open Punctuation 6
 
2.2%
Other Symbol 3
 
1.1%
Space Separator 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
4.9%
12
 
4.6%
11
 
4.2%
11
 
4.2%
11
 
4.2%
9
 
3.4%
8
 
3.0%
7
 
2.7%
7
 
2.7%
6
 
2.3%
Other values (82) 168
63.9%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 266
95.3%
Common 13
 
4.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
4.9%
12
 
4.5%
11
 
4.1%
11
 
4.1%
11
 
4.1%
9
 
3.4%
8
 
3.0%
7
 
2.6%
7
 
2.6%
6
 
2.3%
Other values (83) 171
64.3%
Common
ValueCountFrequency (%)
) 6
46.2%
( 6
46.2%
1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 263
94.3%
ASCII 13
 
4.7%
None 3
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
13
 
4.9%
12
 
4.6%
11
 
4.2%
11
 
4.2%
11
 
4.2%
9
 
3.4%
8
 
3.0%
7
 
2.7%
7
 
2.7%
6
 
2.3%
Other values (82) 168
63.9%
ASCII
ValueCountFrequency (%)
) 6
46.2%
( 6
46.2%
1
 
7.7%
None
ValueCountFrequency (%)
3
100.0%
Distinct56
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size580.0 B
2023-12-11T03:42:27.573985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length29
Mean length25.5
Min length21

Characters and Unicode

Total characters1428
Distinct characters55
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)100.0%

Sample

1st row대구광역시 북구 노원로42길 45 (침산동)
2nd row대구광역시 북구 침산남로7길 40 (침산동)
3rd row대구광역시 북구 칠성남로37길 22 (칠성동2가)
4th row대구광역시 북구 팔달로37길 14-6(노원동2가)
5th row대구광역시 북구 유통단지로8길 50(산격동)
ValueCountFrequency (%)
대구광역시 56
20.7%
북구 56
20.7%
산격동 18
 
6.6%
침산동 14
 
5.2%
노원로42길 5
 
1.8%
검단공단로21길 5
 
1.8%
연암로42길 5
 
1.8%
검단북로 4
 
1.5%
검단동 4
 
1.5%
유통단지로3길 4
 
1.5%
Other values (84) 100
36.9%
2023-12-11T03:42:28.413920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
215
 
15.1%
112
 
7.8%
69
 
4.8%
60
 
4.2%
( 58
 
4.1%
57
 
4.0%
57
 
4.0%
) 57
 
4.0%
56
 
3.9%
56
 
3.9%
Other values (45) 631
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 837
58.6%
Decimal Number 236
 
16.5%
Space Separator 215
 
15.1%
Open Punctuation 58
 
4.1%
Close Punctuation 57
 
4.0%
Dash Punctuation 25
 
1.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
112
13.4%
69
 
8.2%
60
 
7.2%
57
 
6.8%
57
 
6.8%
56
 
6.7%
56
 
6.7%
56
 
6.7%
47
 
5.6%
43
 
5.1%
Other values (31) 224
26.8%
Decimal Number
ValueCountFrequency (%)
1 55
23.3%
2 52
22.0%
4 29
12.3%
3 24
10.2%
5 18
 
7.6%
6 15
 
6.4%
7 14
 
5.9%
0 14
 
5.9%
8 9
 
3.8%
9 6
 
2.5%
Space Separator
ValueCountFrequency (%)
215
100.0%
Open Punctuation
ValueCountFrequency (%)
( 58
100.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 837
58.6%
Common 591
41.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
112
13.4%
69
 
8.2%
60
 
7.2%
57
 
6.8%
57
 
6.8%
56
 
6.7%
56
 
6.7%
56
 
6.7%
47
 
5.6%
43
 
5.1%
Other values (31) 224
26.8%
Common
ValueCountFrequency (%)
215
36.4%
( 58
 
9.8%
) 57
 
9.6%
1 55
 
9.3%
2 52
 
8.8%
4 29
 
4.9%
- 25
 
4.2%
3 24
 
4.1%
5 18
 
3.0%
6 15
 
2.5%
Other values (4) 43
 
7.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 837
58.6%
ASCII 591
41.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
215
36.4%
( 58
 
9.8%
) 57
 
9.6%
1 55
 
9.3%
2 52
 
8.8%
4 29
 
4.9%
- 25
 
4.2%
3 24
 
4.1%
5 18
 
3.0%
6 15
 
2.5%
Other values (4) 43
 
7.3%
Hangul
ValueCountFrequency (%)
112
13.4%
69
 
8.2%
60
 
7.2%
57
 
6.8%
57
 
6.8%
56
 
6.7%
56
 
6.7%
56
 
6.7%
47
 
5.6%
43
 
5.1%
Other values (31) 224
26.8%

전화번호
Text

MISSING 

Distinct50
Distinct (%)98.0%
Missing5
Missing (%)8.9%
Memory size580.0 B
2023-12-11T03:42:28.917498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters612
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)96.1%

Sample

1st row053-358-1188
2nd row053-352-2041
3rd row053-358-0750
4th row053-954-7979
5th row053-955-5525
ValueCountFrequency (%)
053-351-0886 2
 
3.9%
053-352-8721 1
 
2.0%
053-359-2926 1
 
2.0%
053-358-1188 1
 
2.0%
053-359-0877 1
 
2.0%
053-357-0132 1
 
2.0%
053-357-4590 1
 
2.0%
053-382-0760 1
 
2.0%
053-356-4986 1
 
2.0%
053-954-4600 1
 
2.0%
Other values (40) 40
78.4%
2023-12-11T03:42:29.500321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 124
20.3%
5 105
17.2%
- 102
16.7%
0 74
12.1%
8 47
 
7.7%
2 34
 
5.6%
1 33
 
5.4%
7 32
 
5.2%
9 23
 
3.8%
4 20
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 510
83.3%
Dash Punctuation 102
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 124
24.3%
5 105
20.6%
0 74
14.5%
8 47
 
9.2%
2 34
 
6.7%
1 33
 
6.5%
7 32
 
6.3%
9 23
 
4.5%
4 20
 
3.9%
6 18
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 102
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 612
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 124
20.3%
5 105
17.2%
- 102
16.7%
0 74
12.1%
8 47
 
7.7%
2 34
 
5.6%
1 33
 
5.4%
7 32
 
5.2%
9 23
 
3.8%
4 20
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 612
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 124
20.3%
5 105
17.2%
- 102
16.7%
0 74
12.1%
8 47
 
7.7%
2 34
 
5.6%
1 33
 
5.4%
7 32
 
5.2%
9 23
 
3.8%
4 20
 
3.3%

소음진동
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size580.0 B
소음
26 
소음허가
24 
소음진동
소음진동신고
 
1

Length

Max length6
Median length4
Mean length3.1071429
Min length2

Unique

Unique1 ?
Unique (%)1.8%

Sample

1st row소음허가
2nd row소음허가
3rd row소음허가
4th row소음허가
5th row소음허가

Common Values

ValueCountFrequency (%)
소음 26
46.4%
소음허가 24
42.9%
소음진동 5
 
8.9%
소음진동신고 1
 
1.8%

Length

2023-12-11T03:42:29.757932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T03:42:29.976318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소음 26
46.4%
소음허가 24
42.9%
소음진동 5
 
8.9%
소음진동신고 1
 
1.8%

지역구분
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)8.9%
Missing0
Missing (%)0.0%
Memory size580.0 B
준공업
30 
준주거
17 
일반주거
2종주거
 
2
자연녹지
 
2

Length

Max length4
Median length3
Mean length3.1607143
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row준주거
2nd row2종주거
3rd row준주거
4th row일반주거
5th row일반주거

Common Values

ValueCountFrequency (%)
준공업 30
53.6%
준주거 17
30.4%
일반주거 5
 
8.9%
2종주거 2
 
3.6%
자연녹지 2
 
3.6%

Length

2023-12-11T03:42:30.160163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T03:42:30.345692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
준공업 30
53.6%
준주거 17
30.4%
일반주거 5
 
8.9%
2종주거 2
 
3.6%
자연녹지 2
 
3.6%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size580.0 B
Minimum2019-09-10 00:00:00
Maximum2019-09-10 00:00:00
2023-12-11T03:42:30.504283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T03:42:30.659081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-11T03:42:30.797559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명소재지도로명주소전화번호소음진동지역구분
업소명1.0001.0001.0001.0001.000
소재지도로명주소1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
소음진동1.0001.0001.0001.0000.623
지역구분1.0001.0001.0000.6231.000
2023-12-11T03:42:30.941880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소음진동지역구분
소음진동1.0000.545
지역구분0.5451.000
2023-12-11T03:42:31.051724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소음진동지역구분
소음진동1.0000.545
지역구분0.5451.000

Missing values

2023-12-11T03:42:25.810394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T03:42:26.054375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명소재지도로명주소전화번호소음진동지역구분데이터기준일자
0성수직물공장대구광역시 북구 노원로42길 45 (침산동)053-358-1188소음허가준주거2019-09-10
1청안도금대구광역시 북구 침산남로7길 40 (침산동)053-352-2041소음허가2종주거2019-09-10
2대영플라스틱대구광역시 북구 칠성남로37길 22 (칠성동2가)<NA>소음허가준주거2019-09-10
3아주직물대구광역시 북구 팔달로37길 14-6(노원동2가)053-358-0750소음허가일반주거2019-09-10
4선미산업(주)대구광역시 북구 유통단지로8길 50(산격동)053-954-7979소음허가일반주거2019-09-10
5영성직물공장대구광역시 북구 검단공단로21길 54-36 (산격동)053-955-5525소음준공업2019-09-10
6(주)화랑고무대구광역시 북구 검단공단로21길 54-42 (산격동)053-382-7711소음준공업2019-09-10
7신광직물대구광역시 북구 노원로39길 7(침산동)053-352-7077소음준공업2019-09-10
8동신연사대구광역시 북구 침산남로 165-8 (침산동)053-357-1539소음허가일반주거2019-09-10
9세원물산(주)대구광역시 북구 칠곡중앙대로65길 21 (태전동)053-314-5161소음허가2종주거2019-09-10
업소명소재지도로명주소전화번호소음진동지역구분데이터기준일자
46우방기계제작조대구광역시 북구 검단북로2길 16-6 (산격동)053-383-9147소음준공업2019-09-10
47미래테크㈜대구광역시 북구 검단북로11길 63(검단동)053-381-3427소음진동자연녹지2019-09-10
48하나정밀대구광역시 북구 노원동로 10(노원동2가)<NA>소음허가준주거2019-09-10
49삼일코팅대구광역시 북구 노원로42길 55(침산동)053-382-4353소음허가준주거2019-09-10
50에이스코팅대구광역시 북구 연암로42길 42-4 (산격동)053-356-4422소음진동신고준공업2019-09-10
51태정정밀대구광역시 북구 노원로42길 57-6 (침산동)053-358-1596소음허가준주거2019-09-10
52성진분체대구광역시 북구 검단북로2길 24(산격동)053-359-2926소음진동준공업2019-09-10
53명신정공대구광역시 북구 연암로42길 17-1(산격동)053-351-7539소음준공업2019-09-10
54경일섬유대구광역시 북구 연암로42길 30(산격동)053-381-2257소음준공업2019-09-10
55성신정밀대구광역시 북구 연암로42길 24-4(노원동3가)<NA>소음준공업2019-09-10