Overview

Dataset statistics

Number of variables7
Number of observations22
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory63.0 B

Variable types

Numeric1
Categorical4
Text2

Dataset

Description경상북도 김천시의 대형 폐기물 수거 업체에 관한 정보로 업체명, 담당구역(행정동), 주소지, 전화번호 정보를 제공합니다.
Author경상북도 김천시
URLhttps://www.data.go.kr/data/15093158/fileData.do

Alerts

업체명 has constant value ""Constant
주소지 has constant value ""Constant
전화번호 has constant value ""Constant
데이터기준일자 has constant value ""Constant
연번 has unique valuesUnique
담당구역(행정동) has unique valuesUnique
담당구역(법정동) has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:09:18.249661
Analysis finished2023-12-12 19:09:18.816481
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.5
Minimum1
Maximum22
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size330.0 B
2023-12-13T04:09:18.883869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.05
Q16.25
median11.5
Q316.75
95-th percentile20.95
Maximum22
Range21
Interquartile range (IQR)10.5

Descriptive statistics

Standard deviation6.4935866
Coefficient of variation (CV)0.5646597
Kurtosis-1.2
Mean11.5
Median Absolute Deviation (MAD)5.5
Skewness0
Sum253
Variance42.166667
MonotonicityStrictly increasing
2023-12-13T04:09:19.011469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
1 1
 
4.5%
13 1
 
4.5%
22 1
 
4.5%
21 1
 
4.5%
20 1
 
4.5%
19 1
 
4.5%
18 1
 
4.5%
17 1
 
4.5%
16 1
 
4.5%
15 1
 
4.5%
Other values (12) 12
54.5%
ValueCountFrequency (%)
1 1
4.5%
2 1
4.5%
3 1
4.5%
4 1
4.5%
5 1
4.5%
6 1
4.5%
7 1
4.5%
8 1
4.5%
9 1
4.5%
10 1
4.5%
ValueCountFrequency (%)
22 1
4.5%
21 1
4.5%
20 1
4.5%
19 1
4.5%
18 1
4.5%
17 1
4.5%
16 1
4.5%
15 1
4.5%
14 1
4.5%
13 1
4.5%

업체명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size308.0 B
영남환경
22 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영남환경
2nd row영남환경
3rd row영남환경
4th row영남환경
5th row영남환경

Common Values

ValueCountFrequency (%)
영남환경 22
100.0%

Length

2023-12-13T04:09:19.161455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:09:19.278419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영남환경 22
100.0%
Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size308.0 B
2023-12-13T04:09:19.493581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.0454545
Min length2

Characters and Unicode

Total characters67
Distinct characters37
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)100.0%

Sample

1st row아포읍
2nd row농소면
3rd row남면
4th row개령면
5th row감문면
ValueCountFrequency (%)
아포읍 1
 
4.5%
농소면 1
 
4.5%
지좌동 1
 
4.5%
대곡동 1
 
4.5%
대신동 1
 
4.5%
양금동 1
 
4.5%
평화남산동 1
 
4.5%
자산동 1
 
4.5%
증산면 1
 
4.5%
대덕면 1
 
4.5%
Other values (12) 12
54.5%
2023-12-13T04:09:19.880328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
20.9%
7
 
10.4%
4
 
6.0%
4
 
6.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
Other values (27) 27
40.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
20.9%
7
 
10.4%
4
 
6.0%
4
 
6.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
Other values (27) 27
40.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
20.9%
7
 
10.4%
4
 
6.0%
4
 
6.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
Other values (27) 27
40.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
14
20.9%
7
 
10.4%
4
 
6.0%
4
 
6.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
Other values (27) 27
40.3%
Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size308.0 B
2023-12-13T04:09:20.095200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.0454545
Min length2

Characters and Unicode

Total characters67
Distinct characters37
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)100.0%

Sample

1st row아포읍
2nd row농소면
3rd row남면
4th row개령면
5th row감문면
ValueCountFrequency (%)
아포읍 1
 
4.5%
농소면 1
 
4.5%
지좌동 1
 
4.5%
대곡동 1
 
4.5%
대신동 1
 
4.5%
양금동 1
 
4.5%
평화남산동 1
 
4.5%
자산동 1
 
4.5%
증산면 1
 
4.5%
대덕면 1
 
4.5%
Other values (12) 12
54.5%
2023-12-13T04:09:20.529110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
20.9%
7
 
10.4%
4
 
6.0%
4
 
6.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
Other values (27) 27
40.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
20.9%
7
 
10.4%
4
 
6.0%
4
 
6.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
Other values (27) 27
40.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
20.9%
7
 
10.4%
4
 
6.0%
4
 
6.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
Other values (27) 27
40.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
14
20.9%
7
 
10.4%
4
 
6.0%
4
 
6.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
Other values (27) 27
40.3%

주소지
Categorical

CONSTANT 

Distinct1
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size308.0 B
김천시 영남대로 1949-2
22 

Length

Max length15
Median length15
Mean length15
Min length15

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row김천시 영남대로 1949-2
2nd row김천시 영남대로 1949-2
3rd row김천시 영남대로 1949-2
4th row김천시 영남대로 1949-2
5th row김천시 영남대로 1949-2

Common Values

ValueCountFrequency (%)
김천시 영남대로 1949-2 22
100.0%

Length

2023-12-13T04:09:20.681487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:09:20.803975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
김천시 22
33.3%
영남대로 22
33.3%
1949-2 22
33.3%

전화번호
Categorical

CONSTANT 

Distinct1
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size308.0 B
054-435-8488
22 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row054-435-8488
2nd row054-435-8488
3rd row054-435-8488
4th row054-435-8488
5th row054-435-8488

Common Values

ValueCountFrequency (%)
054-435-8488 22
100.0%

Length

2023-12-13T04:09:20.929264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:09:21.076080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
054-435-8488 22
100.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size308.0 B
2021-10-21
22 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-10-21
2nd row2021-10-21
3rd row2021-10-21
4th row2021-10-21
5th row2021-10-21

Common Values

ValueCountFrequency (%)
2021-10-21 22
100.0%

Length

2023-12-13T04:09:21.216768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:09:21.338172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-10-21 22
100.0%

Interactions

2023-12-13T04:09:18.432863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:09:21.422371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번담당구역(행정동)담당구역(법정동)
연번1.0001.0001.000
담당구역(행정동)1.0001.0001.000
담당구역(법정동)1.0001.0001.000

Missing values

2023-12-13T04:09:18.583984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:09:18.758482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명담당구역(행정동)담당구역(법정동)주소지전화번호데이터기준일자
01영남환경아포읍아포읍김천시 영남대로 1949-2054-435-84882021-10-21
12영남환경농소면농소면김천시 영남대로 1949-2054-435-84882021-10-21
23영남환경남면남면김천시 영남대로 1949-2054-435-84882021-10-21
34영남환경개령면개령면김천시 영남대로 1949-2054-435-84882021-10-21
45영남환경감문면감문면김천시 영남대로 1949-2054-435-84882021-10-21
56영남환경어모면어모면김천시 영남대로 1949-2054-435-84882021-10-21
67영남환경봉산면봉산면김천시 영남대로 1949-2054-435-84882021-10-21
78영남환경대항면대항면김천시 영남대로 1949-2054-435-84882021-10-21
89영남환경감천면감천면김천시 영남대로 1949-2054-435-84882021-10-21
910영남환경조마면조마면김천시 영남대로 1949-2054-435-84882021-10-21
연번업체명담당구역(행정동)담당구역(법정동)주소지전화번호데이터기준일자
1213영남환경부항면부항면김천시 영남대로 1949-2054-435-84882021-10-21
1314영남환경대덕면대덕면김천시 영남대로 1949-2054-435-84882021-10-21
1415영남환경증산면증산면김천시 영남대로 1949-2054-435-84882021-10-21
1516영남환경자산동자산동김천시 영남대로 1949-2054-435-84882021-10-21
1617영남환경평화남산동평화남산동김천시 영남대로 1949-2054-435-84882021-10-21
1718영남환경양금동양금동김천시 영남대로 1949-2054-435-84882021-10-21
1819영남환경대신동대신동김천시 영남대로 1949-2054-435-84882021-10-21
1920영남환경대곡동대곡동김천시 영남대로 1949-2054-435-84882021-10-21
2021영남환경지좌동지좌동김천시 영남대로 1949-2054-435-84882021-10-21
2122영남환경율곡동율곡동김천시 영남대로 1949-2054-435-84882021-10-21