Overview

Dataset statistics

Number of variables7
Number of observations32
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory61.0 B

Variable types

Numeric1
Categorical2
Text4

Dataset

Description대구광역시_배출가스서비스업 정보_20210131
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15022053&dataSetDetailId=150220531d1ca6682085f&provdMethod=FILE

Alerts

연번 is highly overall correlated with 지역High correlation
지역 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
업소명 has unique valuesUnique
소재지 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2024-04-20 21:29:04.124837
Analysis finished2024-04-20 21:29:05.511825
Duration1.39 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct32
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.5
Minimum1
Maximum32
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size416.0 B
2024-04-21T06:29:05.692754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.55
Q18.75
median16.5
Q324.25
95-th percentile30.45
Maximum32
Range31
Interquartile range (IQR)15.5

Descriptive statistics

Standard deviation9.3808315
Coefficient of variation (CV)0.56853524
Kurtosis-1.2
Mean16.5
Median Absolute Deviation (MAD)8
Skewness0
Sum528
Variance88
MonotonicityStrictly increasing
2024-04-21T06:29:06.092942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
1 1
 
3.1%
18 1
 
3.1%
32 1
 
3.1%
31 1
 
3.1%
30 1
 
3.1%
29 1
 
3.1%
28 1
 
3.1%
27 1
 
3.1%
26 1
 
3.1%
25 1
 
3.1%
Other values (22) 22
68.8%
ValueCountFrequency (%)
1 1
3.1%
2 1
3.1%
3 1
3.1%
4 1
3.1%
5 1
3.1%
6 1
3.1%
7 1
3.1%
8 1
3.1%
9 1
3.1%
10 1
3.1%
ValueCountFrequency (%)
32 1
3.1%
31 1
3.1%
30 1
3.1%
29 1
3.1%
28 1
3.1%
27 1
3.1%
26 1
3.1%
25 1
3.1%
24 1
3.1%
23 1
3.1%

지역
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)21.9%
Missing0
Missing (%)0.0%
Memory size384.0 B
대구시 달서구
13 
대구시 서구
대구시 북구
대구시 동구
대구시 달성군
Other values (2)

Length

Max length7
Median length7
Mean length6.5625
Min length6

Unique

Unique1 ?
Unique (%)3.1%

Sample

1st row대구시 동구
2nd row대구시 동구
3rd row대구시 동구
4th row대구시 서구
5th row대구시 서구

Common Values

ValueCountFrequency (%)
대구시 달서구 13
40.6%
대구시 서구 6
18.8%
대구시 북구 4
 
12.5%
대구시 동구 3
 
9.4%
대구시 달성군 3
 
9.4%
대구시 수성구 2
 
6.2%
대구시 남구 1
 
3.1%

Length

2024-04-21T06:29:06.380273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T06:29:06.581982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구시 32
50.0%
달서구 13
20.3%
서구 6
 
9.4%
북구 4
 
6.2%
동구 3
 
4.7%
달성군 3
 
4.7%
수성구 2
 
3.1%
남구 1
 
1.6%

업소명
Text

UNIQUE 

Distinct32
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size384.0 B
2024-04-21T06:29:07.247787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length8.1875
Min length4

Characters and Unicode

Total characters262
Distinct characters99
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)100.0%

Sample

1st row 반야월자동차정비공장
2nd row스피드1급카센터
3rd row자동차도우미
4th row르노삼성자동차㈜대구사업소
5th row삼천리보림부란자
ValueCountFrequency (%)
대구서비스센터 2
 
5.4%
반야월자동차정비공장 1
 
2.7%
박윤자동차시스템 1
 
2.7%
한일1급정비 1
 
2.7%
현대남대구서비스 1
 
2.7%
호림자동차검사정비 1
 
2.7%
애니카랜드성서공단점 1
 
2.7%
범양카tbo플러스 1
 
2.7%
명륜카센터 1
 
2.7%
대곡종합카센타 1
 
2.7%
Other values (26) 26
70.3%
2024-04-21T06:29:08.159931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15
 
5.7%
14
 
5.3%
12
 
4.6%
11
 
4.2%
11
 
4.2%
11
 
4.2%
10
 
3.8%
8
 
3.1%
8
 
3.1%
8
 
3.1%
Other values (89) 154
58.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 245
93.5%
Space Separator 6
 
2.3%
Decimal Number 5
 
1.9%
Other Symbol 3
 
1.1%
Uppercase Letter 3
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
6.1%
14
 
5.7%
12
 
4.9%
11
 
4.5%
11
 
4.5%
11
 
4.5%
10
 
4.1%
8
 
3.3%
8
 
3.3%
8
 
3.3%
Other values (83) 137
55.9%
Uppercase Letter
ValueCountFrequency (%)
T 1
33.3%
B 1
33.3%
O 1
33.3%
Space Separator
ValueCountFrequency (%)
6
100.0%
Decimal Number
ValueCountFrequency (%)
1 5
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 248
94.7%
Common 11
 
4.2%
Latin 3
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
 
6.0%
14
 
5.6%
12
 
4.8%
11
 
4.4%
11
 
4.4%
11
 
4.4%
10
 
4.0%
8
 
3.2%
8
 
3.2%
8
 
3.2%
Other values (84) 140
56.5%
Latin
ValueCountFrequency (%)
T 1
33.3%
B 1
33.3%
O 1
33.3%
Common
ValueCountFrequency (%)
6
54.5%
1 5
45.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 245
93.5%
ASCII 14
 
5.3%
None 3
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
15
 
6.1%
14
 
5.7%
12
 
4.9%
11
 
4.5%
11
 
4.5%
11
 
4.5%
10
 
4.1%
8
 
3.3%
8
 
3.3%
8
 
3.3%
Other values (83) 137
55.9%
ASCII
ValueCountFrequency (%)
6
42.9%
1 5
35.7%
T 1
 
7.1%
B 1
 
7.1%
O 1
 
7.1%
None
ValueCountFrequency (%)
3
100.0%
Distinct30
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size384.0 B
2024-04-21T06:29:08.781894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.40625
Min length3

Characters and Unicode

Total characters109
Distinct characters58
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)90.6%

Sample

1st row박종근
2nd row전병효
3rd row이성철
4th row대표이사
5th row이철호
ValueCountFrequency (%)
대표이사 3
 
8.6%
김만열 1
 
2.9%
손경희 1
 
2.9%
이후근 1
 
2.9%
김종만 1
 
2.9%
이정호 1
 
2.9%
배성호 1
 
2.9%
이상일 1
 
2.9%
박영배 1
 
2.9%
이도하 1
 
2.9%
Other values (23) 23
65.7%
2024-04-21T06:29:09.611156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
8.3%
6
 
5.5%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (48) 68
62.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 104
95.4%
Space Separator 3
 
2.8%
Decimal Number 1
 
0.9%
Other Punctuation 1
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
8.7%
6
 
5.8%
4
 
3.8%
4
 
3.8%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (45) 63
60.6%
Space Separator
ValueCountFrequency (%)
3
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 104
95.4%
Common 5
 
4.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
8.7%
6
 
5.8%
4
 
3.8%
4
 
3.8%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (45) 63
60.6%
Common
ValueCountFrequency (%)
3
60.0%
1 1
 
20.0%
, 1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 104
95.4%
ASCII 5
 
4.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
8.7%
6
 
5.8%
4
 
3.8%
4
 
3.8%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (45) 63
60.6%
ASCII
ValueCountFrequency (%)
3
60.0%
1 1
 
20.0%
, 1
 
20.0%

소재지
Text

UNIQUE 

Distinct32
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size384.0 B
2024-04-21T06:29:10.377275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length18
Min length10

Characters and Unicode

Total characters576
Distinct characters85
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)100.0%

Sample

1st row 동구 안심로 55길 52 (동호동)
2nd row 동구 송라로 34
3rd row 동구 공항로 135(불로동)
4th row 서구 와룡로 392(중리동)
5th row 서구 와룡로87길 68,(이현동)
ValueCountFrequency (%)
달서구 13
 
11.8%
서구 6
 
5.5%
북구 4
 
3.6%
동구 3
 
2.7%
달성군 3
 
2.7%
북비산로 2
 
1.8%
비슬로 2
 
1.8%
진천동 2
 
1.8%
수성구 2
 
1.8%
팔달로 2
 
1.8%
Other values (69) 71
64.5%
2024-04-21T06:29:11.421955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
111
19.3%
32
 
5.6%
31
 
5.4%
30
 
5.2%
29
 
5.0%
( 27
 
4.7%
) 27
 
4.7%
1 19
 
3.3%
19
 
3.3%
3 18
 
3.1%
Other values (75) 233
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 307
53.3%
Space Separator 111
 
19.3%
Decimal Number 99
 
17.2%
Open Punctuation 27
 
4.7%
Close Punctuation 27
 
4.7%
Other Punctuation 4
 
0.7%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
10.4%
31
 
10.1%
30
 
9.8%
29
 
9.4%
19
 
6.2%
13
 
4.2%
10
 
3.3%
8
 
2.6%
7
 
2.3%
6
 
2.0%
Other values (59) 122
39.7%
Decimal Number
ValueCountFrequency (%)
1 19
19.2%
3 18
18.2%
9 11
11.1%
4 11
11.1%
5 10
10.1%
7 9
9.1%
2 7
 
7.1%
6 7
 
7.1%
8 5
 
5.1%
0 2
 
2.0%
Other Punctuation
ValueCountFrequency (%)
, 3
75.0%
. 1
 
25.0%
Space Separator
ValueCountFrequency (%)
111
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 307
53.3%
Common 269
46.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
10.4%
31
 
10.1%
30
 
9.8%
29
 
9.4%
19
 
6.2%
13
 
4.2%
10
 
3.3%
8
 
2.6%
7
 
2.3%
6
 
2.0%
Other values (59) 122
39.7%
Common
ValueCountFrequency (%)
111
41.3%
( 27
 
10.0%
) 27
 
10.0%
1 19
 
7.1%
3 18
 
6.7%
9 11
 
4.1%
4 11
 
4.1%
5 10
 
3.7%
7 9
 
3.3%
2 7
 
2.6%
Other values (6) 19
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 307
53.3%
ASCII 269
46.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
111
41.3%
( 27
 
10.0%
) 27
 
10.0%
1 19
 
7.1%
3 18
 
6.7%
9 11
 
4.1%
4 11
 
4.1%
5 10
 
3.7%
7 9
 
3.3%
2 7
 
2.6%
Other values (6) 19
 
7.1%
Hangul
ValueCountFrequency (%)
32
 
10.4%
31
 
10.1%
30
 
9.8%
29
 
9.4%
19
 
6.2%
13
 
4.2%
10
 
3.3%
8
 
2.6%
7
 
2.3%
6
 
2.0%
Other values (59) 122
39.7%

전화번호
Text

UNIQUE 

Distinct32
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size384.0 B
2024-04-21T06:29:12.099213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters256
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)100.0%

Sample

1st row964-0964
2nd row754-1006
3rd row382-5582
4th row653-3333
5th row563-0636
ValueCountFrequency (%)
964-0964 1
 
3.1%
754-1006 1
 
3.1%
563-9551 1
 
3.1%
615-4747 1
 
3.1%
581-0014 1
 
3.1%
591-1026 1
 
3.1%
587-5109 1
 
3.1%
633-0877 1
 
3.1%
583-7364 1
 
3.1%
636-2580 1
 
3.1%
Other values (22) 22
68.8%
2024-04-21T06:29:12.981327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 50
19.5%
- 32
12.5%
3 26
10.2%
6 23
9.0%
1 22
8.6%
4 21
8.2%
8 21
8.2%
0 19
 
7.4%
7 19
 
7.4%
2 12
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 224
87.5%
Dash Punctuation 32
 
12.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 50
22.3%
3 26
11.6%
6 23
10.3%
1 22
9.8%
4 21
9.4%
8 21
9.4%
0 19
 
8.5%
7 19
 
8.5%
2 12
 
5.4%
9 11
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 256
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 50
19.5%
- 32
12.5%
3 26
10.2%
6 23
9.0%
1 22
8.6%
4 21
8.2%
8 21
8.2%
0 19
 
7.4%
7 19
 
7.4%
2 12
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 256
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 50
19.5%
- 32
12.5%
3 26
10.2%
6 23
9.0%
1 22
8.6%
4 21
8.2%
8 21
8.2%
0 19
 
7.4%
7 19
 
7.4%
2 12
 
4.7%

측정항목
Categorical

Distinct2
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Memory size384.0 B
CO,HC,λ,NOx,매연
17 
매연
15 

Length

Max length14
Median length14
Mean length8.375
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCO,HC,λ,NOx,매연
2nd row매연
3rd row매연
4th rowCO,HC,λ,NOx,매연
5th row매연

Common Values

ValueCountFrequency (%)
CO,HC,λ,NOx,매연 17
53.1%
매연 15
46.9%

Length

2024-04-21T06:29:13.218101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T06:29:13.398051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
co,hc,λ,nox,매연 17
53.1%
매연 15
46.9%

Interactions

2024-04-21T06:29:04.646777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T06:29:13.516446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역업소명대표자소재지전화번호측정항목
연번1.0000.8941.0000.9691.0001.0000.000
지역0.8941.0001.0000.9231.0001.0000.000
업소명1.0001.0001.0001.0001.0001.0001.000
대표자0.9690.9231.0001.0001.0001.0001.000
소재지1.0001.0001.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.0001.0001.000
측정항목0.0000.0001.0001.0001.0001.0001.000
2024-04-21T06:29:13.694562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정항목지역
측정항목1.0000.000
지역0.0001.000
2024-04-21T06:29:13.829561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역측정항목
연번1.0000.6890.000
지역0.6891.0000.000
측정항목0.0000.0001.000

Missing values

2024-04-21T06:29:04.983584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T06:29:05.367314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번지역업소명대표자소재지전화번호측정항목
01대구시 동구반야월자동차정비공장박종근동구 안심로 55길 52 (동호동)964-0964CO,HC,λ,NOx,매연
12대구시 동구스피드1급카센터전병효동구 송라로 34754-1006매연
23대구시 동구자동차도우미이성철동구 공항로 135(불로동)382-5582매연
34대구시 서구르노삼성자동차㈜대구사업소대표이사서구 와룡로 392(중리동)653-3333CO,HC,λ,NOx,매연
45대구시 서구삼천리보림부란자이철호서구 와룡로87길 68,(이현동)563-0636매연
56대구시 서구오토1급 카센터박일랑서구 서대구로 225,(평리동)559-3510CO,HC,λ,NOx,매연
67대구시 서구경북1급종합정비곽동윤서구 북비산로 17길 12.(이현동)526-8155매연
78대구시 서구청구자동차 전문정비석용우서구 국채보상로 196,(평리동)551-9737CO,HC,λ,NOx,매연
89대구시 서구스피드메이트 서구비산점김진환서구 북비산로 371(비산동)525-5757CO,HC,λ,NOx,매연
910대구시 남구엑스레이싱신성일남구 중앙대로31길137764-3444매연
연번지역업소명대표자소재지전화번호측정항목
2223대구시 달서구명륜카센터윤경화달서구 야외음악당로47 (성당동)651-0802매연
2324대구시 달서구대곡종합카센타박상영달서구 미리샘길 7 (도원동)636-2580CO,HC,λ,NOx,매연
2425대구시 달서구스마일자동차정비이도하달서구 월암로 88 (월암동)583-7364매연
2526대구시 달서구청구1급카써비스박영배달서구 진천로 14 (진천동)633-0877CO,HC,λ,NOx,매연
2627대구시 달서구카매니져이상일달서구 성서4차첨단로 169(월암동)587-5109매연
2728대구시 달서구진모터스배성호달서구 성서서로 36길 3(갈산동)591-1026매연
2829대구시 달서구골든오토이정호달서구 성서4차첨단로111(대천동)581-0014매연
2930대구시 달성군창림종합정비공장김종만달성군 논공읍 비슬로 1947615-4747CO,HC,λ,NOx,매연
3031대구시 달성군애니카랜드다사점이후근달성군 다사읍 대실역북로 90563-9551매연
3132대구시 달성군스카이모터샵최종철달성군 화원읍 비슬로 2693636-0070매연