Overview

Dataset statistics

Number of variables6
Number of observations57
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.9 KiB
Average record size in memory51.3 B

Variable types

Numeric1
Categorical3
Text2

Dataset

Description대구광역시 재생플라스틱 취급업체 현황_20211122
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15035495&dataSetDetailId=150354951caedf14f178e&provdMethod=FILE

Alerts

연번 is highly overall correlated with 구군 and 1 other fieldsHigh correlation
구군 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
폐기물재활용업 허가 is highly overall correlated with 연번High correlation
영업대상폐기물 is highly overall correlated with 구군High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 19:06:15.471030
Analysis finished2023-12-10 19:06:16.284122
Duration0.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct57
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29
Minimum1
Maximum57
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size645.0 B
2023-12-11T04:06:16.371079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.8
Q115
median29
Q343
95-th percentile54.2
Maximum57
Range56
Interquartile range (IQR)28

Descriptive statistics

Standard deviation16.598193
Coefficient of variation (CV)0.57235147
Kurtosis-1.2
Mean29
Median Absolute Deviation (MAD)14
Skewness0
Sum1653
Variance275.5
MonotonicityStrictly increasing
2023-12-11T04:06:16.547371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.8%
44 1
 
1.8%
32 1
 
1.8%
33 1
 
1.8%
34 1
 
1.8%
35 1
 
1.8%
36 1
 
1.8%
37 1
 
1.8%
38 1
 
1.8%
39 1
 
1.8%
Other values (47) 47
82.5%
ValueCountFrequency (%)
1 1
1.8%
2 1
1.8%
3 1
1.8%
4 1
1.8%
5 1
1.8%
6 1
1.8%
7 1
1.8%
8 1
1.8%
9 1
1.8%
10 1
1.8%
ValueCountFrequency (%)
57 1
1.8%
56 1
1.8%
55 1
1.8%
54 1
1.8%
53 1
1.8%
52 1
1.8%
51 1
1.8%
50 1
1.8%
49 1
1.8%
48 1
1.8%

구군
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size588.0 B
달성군
28 
서구
12 
달서구
동구
북구

Length

Max length3
Median length3
Mean length2.6491228
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서구
2nd row서구
3rd row북구
4th row북구
5th row달성군

Common Values

ValueCountFrequency (%)
달성군 28
49.1%
서구 12
21.1%
달서구 9
 
15.8%
동구 5
 
8.8%
북구 3
 
5.3%

Length

2023-12-11T04:06:16.736815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T04:06:16.878154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
달성군 28
49.1%
서구 12
21.1%
달서구 9
 
15.8%
동구 5
 
8.8%
북구 3
 
5.3%

폐기물재활용업 허가
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size588.0 B
종합재활용업
40 
중간재활용업
17 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중간재활용업
2nd row중간재활용업
3rd row중간재활용업
4th row중간재활용업
5th row중간재활용업

Common Values

ValueCountFrequency (%)
종합재활용업 40
70.2%
중간재활용업 17
29.8%

Length

2023-12-11T04:06:17.029391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T04:06:17.170422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종합재활용업 40
70.2%
중간재활용업 17
29.8%
Distinct55
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size588.0 B
2023-12-11T04:06:17.443717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length7.122807
Min length2

Characters and Unicode

Total characters406
Distinct characters121
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)93.0%

Sample

1st row(주)해동자원 서대구공장
2nd row달구벌산업 ㈜대구공장
3rd row대호
4th row㈜해동자원 북구사업소
5th row(주)대광환경
ValueCountFrequency (%)
주식회사 3
 
4.4%
대호 2
 
2.9%
달성지점 2
 
2.9%
㈜유창알앤씨 2
 
2.9%
유유리싸이클링(주 1
 
1.5%
주)그린알앤이 1
 
1.5%
주)해성합섬 1
 
1.5%
용호그린(주 1
 
1.5%
주)해동자원 1
 
1.5%
서대구공장 1
 
1.5%
Other values (53) 53
77.9%
2023-12-11T04:06:17.926512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
5.4%
( 18
 
4.4%
) 18
 
4.4%
17
 
4.2%
14
 
3.4%
14
 
3.4%
12
 
3.0%
12
 
3.0%
11
 
2.7%
10
 
2.5%
Other values (111) 258
63.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 345
85.0%
Open Punctuation 18
 
4.4%
Close Punctuation 18
 
4.4%
Other Symbol 12
 
3.0%
Space Separator 11
 
2.7%
Uppercase Letter 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
6.4%
17
 
4.9%
14
 
4.1%
14
 
4.1%
12
 
3.5%
10
 
2.9%
10
 
2.9%
10
 
2.9%
8
 
2.3%
8
 
2.3%
Other values (105) 220
63.8%
Uppercase Letter
ValueCountFrequency (%)
H 1
50.0%
J 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Other Symbol
ValueCountFrequency (%)
12
100.0%
Space Separator
ValueCountFrequency (%)
11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 357
87.9%
Common 47
 
11.6%
Latin 2
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
6.2%
17
 
4.8%
14
 
3.9%
14
 
3.9%
12
 
3.4%
12
 
3.4%
10
 
2.8%
10
 
2.8%
10
 
2.8%
8
 
2.2%
Other values (106) 228
63.9%
Common
ValueCountFrequency (%)
( 18
38.3%
) 18
38.3%
11
23.4%
Latin
ValueCountFrequency (%)
H 1
50.0%
J 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 345
85.0%
ASCII 49
 
12.1%
None 12
 
3.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
 
6.4%
17
 
4.9%
14
 
4.1%
14
 
4.1%
12
 
3.5%
10
 
2.9%
10
 
2.9%
10
 
2.9%
8
 
2.3%
8
 
2.3%
Other values (105) 220
63.8%
ASCII
ValueCountFrequency (%)
( 18
36.7%
) 18
36.7%
11
22.4%
H 1
 
2.0%
J 1
 
2.0%
None
ValueCountFrequency (%)
12
100.0%
Distinct55
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size588.0 B
2023-12-11T04:06:18.270727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length18
Mean length13.052632
Min length6

Characters and Unicode

Total characters744
Distinct characters73
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)93.0%

Sample

1st row염색공단천로16길17
2nd row북비산로29길40
3rd row조야로2길 101(서변동)
4th row조야로2길 103(서변동)
5th row하빈면 하빈남로 307-50
ValueCountFrequency (%)
논공읍 13
 
9.6%
하빈면 11
 
8.1%
하빈남로 4
 
3.0%
하산길 3
 
2.2%
안심로65길 3
 
2.2%
현풍서로 3
 
2.2%
현풍읍 3
 
2.2%
노이길 3
 
2.2%
82(갈산동 2
 
1.5%
비슬로264길 2
 
1.5%
Other values (84) 88
65.2%
2023-12-11T04:06:18.820539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
78
 
10.5%
1 51
 
6.9%
49
 
6.6%
43
 
5.8%
2 36
 
4.8%
26
 
3.5%
4 24
 
3.2%
5 22
 
3.0%
21
 
2.8%
6 20
 
2.7%
Other values (63) 374
50.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 383
51.5%
Decimal Number 234
31.5%
Space Separator 78
 
10.5%
Close Punctuation 17
 
2.3%
Open Punctuation 17
 
2.3%
Dash Punctuation 12
 
1.6%
Other Punctuation 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
12.8%
43
 
11.2%
26
 
6.8%
21
 
5.5%
19
 
5.0%
18
 
4.7%
17
 
4.4%
16
 
4.2%
15
 
3.9%
14
 
3.7%
Other values (48) 145
37.9%
Decimal Number
ValueCountFrequency (%)
1 51
21.8%
2 36
15.4%
4 24
10.3%
5 22
9.4%
6 20
 
8.5%
7 20
 
8.5%
3 19
 
8.1%
0 17
 
7.3%
9 13
 
5.6%
8 12
 
5.1%
Space Separator
ValueCountFrequency (%)
78
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 383
51.5%
Common 361
48.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
12.8%
43
 
11.2%
26
 
6.8%
21
 
5.5%
19
 
5.0%
18
 
4.7%
17
 
4.4%
16
 
4.2%
15
 
3.9%
14
 
3.7%
Other values (48) 145
37.9%
Common
ValueCountFrequency (%)
78
21.6%
1 51
14.1%
2 36
10.0%
4 24
 
6.6%
5 22
 
6.1%
6 20
 
5.5%
7 20
 
5.5%
3 19
 
5.3%
) 17
 
4.7%
0 17
 
4.7%
Other values (5) 57
15.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 383
51.5%
ASCII 361
48.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
78
21.6%
1 51
14.1%
2 36
10.0%
4 24
 
6.6%
5 22
 
6.1%
6 20
 
5.5%
7 20
 
5.5%
3 19
 
5.3%
) 17
 
4.7%
0 17
 
4.7%
Other values (5) 57
15.8%
Hangul
ValueCountFrequency (%)
49
 
12.8%
43
 
11.2%
26
 
6.8%
21
 
5.5%
19
 
5.0%
18
 
4.7%
17
 
4.4%
16
 
4.2%
15
 
3.9%
14
 
3.7%
Other values (48) 145
37.9%

영업대상폐기물
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)28.1%
Missing0
Missing (%)0.0%
Memory size588.0 B
폐합성수지 등
28 
폐합성수지
12 
폐지류, 고철및금속캔류, 폐합성수지, 유리병
 
2
폐합성수지류(폐전선,폐통신케이블,고철)
 
2
폐합성수지류
 
2
Other values (11)
11 

Length

Max length38
Median length24
Mean length9.5087719
Min length5

Unique

Unique11 ?
Unique (%)19.3%

Sample

1st row폐합성수지, 폐가구 등
2nd row폐합성수지
3rd row폐지류, 고철및금속캔류, 폐합성수지, 유리병
4th row폐지류, 고철및금속캔류, 폐합성수지, 유리병
5th row폐합성수지 등

Common Values

ValueCountFrequency (%)
폐합성수지 등 28
49.1%
폐합성수지 12
21.1%
폐지류, 고철및금속캔류, 폐합성수지, 유리병 2
 
3.5%
폐합성수지류(폐전선,폐통신케이블,고철) 2
 
3.5%
폐합성수지류 2
 
3.5%
폐합성수지, 폐가구 등 1
 
1.8%
폐합성수지류(폐플라스틱,폐스티로폼) 1
 
1.8%
폐합성수지류(폐전선,폐통신케이블,폐모터,인쇄회로기관,고철,폐합성수지) 1
 
1.8%
폐합성수지류(PVC) 1
 
1.8%
폐목재, 폐합성수지 등 1
 
1.8%
Other values (6) 6
 
10.5%

Length

2023-12-11T04:06:19.040468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
폐합성수지 44
44.0%
31
31.0%
폐합성수지류 4
 
4.0%
폐지류 2
 
2.0%
고철및금속캔류 2
 
2.0%
유리병 2
 
2.0%
폐합성수지류(폐전선,폐통신케이블,고철 2
 
2.0%
pp 2
 
2.0%
폐합성수지류(플라스틱(pe 2
 
2.0%
폐합성수지류(폐플라스틱(pe 1
 
1.0%
Other values (8) 8
 
8.0%

Interactions

2023-12-11T04:06:15.938030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T04:06:19.147077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구군폐기물재활용업 허가업체명소재지영업대상폐기물
연번1.0000.9671.0000.8860.9750.721
구군0.9671.0000.3300.0001.0000.950
폐기물재활용업 허가1.0000.3301.0000.5421.0000.409
업체명0.8860.0000.5421.0000.9900.000
소재지0.9751.0001.0000.9901.0001.000
영업대상폐기물0.7210.9500.4090.0001.0001.000
2023-12-11T04:06:19.284990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물재활용업 허가영업대상폐기물구군
폐기물재활용업 허가1.0000.2700.390
영업대상폐기물0.2701.0000.751
구군0.3900.7511.000
2023-12-11T04:06:19.385048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구군폐기물재활용업 허가영업대상폐기물
연번1.0000.7130.9240.350
구군0.7131.0000.3900.751
폐기물재활용업 허가0.9240.3901.0000.270
영업대상폐기물0.3500.7510.2701.000

Missing values

2023-12-11T04:06:16.070170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T04:06:16.227250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구군폐기물재활용업 허가업체명소재지영업대상폐기물
01서구중간재활용업(주)해동자원 서대구공장염색공단천로16길17폐합성수지, 폐가구 등
12서구중간재활용업달구벌산업 ㈜대구공장북비산로29길40폐합성수지
23북구중간재활용업대호조야로2길 101(서변동)폐지류, 고철및금속캔류, 폐합성수지, 유리병
34북구중간재활용업㈜해동자원 북구사업소조야로2길 103(서변동)폐지류, 고철및금속캔류, 폐합성수지, 유리병
45달성군중간재활용업(주)대광환경하빈면 하빈남로 307-50폐합성수지 등
56달성군중간재활용업(주)디에이치환경논공읍 노이길 198폐합성수지 등
67달성군중간재활용업월드환경(주)현풍읍 현풍서로 102폐합성수지 등
78달성군중간재활용업JH산업옥포읍 원전1길 48폐합성수지 등
89달성군중간재활용업열린환경(주)현풍읍 현풍서로 559폐합성수지 등
910달성군중간재활용업(주)재림환경하빈면 달구벌대로12길 98-21폐합성수지 등
연번구군폐기물재활용업 허가업체명소재지영업대상폐기물
4748달성군종합재활용업(주)디엔와이 대구공장논공읍 논공중앙로 440폐합성수지 등
4849달성군종합재활용업대경수지재활용협동조합논공읍 노이4길 14폐합성수지 등
4950달성군종합재활용업다물환경 주식회사하빈면 하빈남로 374폐합성수지 등
5051달성군종합재활용업주식회사 대한실업 달성지점논공읍 논공로69길 7폐합성수지 등
5152달성군종합재활용업유유리싸이클링(주)하빈면 하산길 129-24폐합성수지 등
5253달성군종합재활용업(주)그린알앤이 달성지점하빈면 달구벌대로8길 110폐합성수지 등
5354달성군종합재활용업태현코리아하빈면 하빈남로 371-31, 371-33폐합성수지 등
5455달성군종합재활용업용호그린(주)하빈면 하빈남로104길 30-18폐합성수지 등
5556달성군종합재활용업(주)해성합섬논공읍 논공중앙로52길 22폐합성수지 등
5657달성군종합재활용업주식회사 브이제이케미칼논공읍 논공로87길 110폐합성수지 등