Overview

Dataset statistics

Number of variables9
Number of observations24
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory78.5 B

Variable types

Categorical8
Numeric1

Dataset

Description한국지역난방공사 공사비부담금 단가에 대한 정보로 온/냉수구분별, 계약종별, 용량범위별 단가를 제공하고 있습니다.
Author한국지역난방공사
URLhttps://www.data.go.kr/data/15090314/fileData.do

Alerts

적용단위(용량) is highly overall correlated with 단가 and 7 other fieldsHigh correlation
계약종별(1) is highly overall correlated with 단가 and 5 other fieldsHigh correlation
계약종별(3) is highly overall correlated with 단가 and 2 other fieldsHigh correlation
구분 is highly overall correlated with 단가 and 5 other fieldsHigh correlation
적용단위(기준) is highly overall correlated with 단가 and 5 other fieldsHigh correlation
기준단위 is highly overall correlated with 단가 and 5 other fieldsHigh correlation
계약종별(2) is highly overall correlated with 구분 and 5 other fieldsHigh correlation
용량범위 is highly overall correlated with 단가 and 5 other fieldsHigh correlation
단가 is highly overall correlated with 구분 and 6 other fieldsHigh correlation
기준단위 is highly imbalanced (58.6%)Imbalance
적용단위(기준) is highly imbalanced (58.6%)Imbalance
단가 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:55:09.318630
Analysis finished2023-12-12 22:55:10.007143
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size324.0 B
온수
20 
냉수

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row온수
2nd row온수
3rd row온수
4th row온수
5th row온수

Common Values

ValueCountFrequency (%)
온수 20
83.3%
냉수 4
 
16.7%

Length

2023-12-13T07:55:10.329430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:55:10.423571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
온수 20
83.3%
냉수 4
 
16.7%

계약종별(1)
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size324.0 B
공공용
10 
업무용
전체
주택용

Length

Max length3
Median length3
Mean length2.8333333
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주택용
2nd row주택용
3rd row업무용
4th row업무용
5th row업무용

Common Values

ValueCountFrequency (%)
공공용 10
41.7%
업무용 8
33.3%
전체 4
 
16.7%
주택용 2
 
8.3%

Length

2023-12-13T07:55:10.512532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:55:10.592989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공공용 10
41.7%
업무용 8
33.3%
전체 4
 
16.7%
주택용 2
 
8.3%

계약종별(2)
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size324.0 B
<NA>
14 
그 이외
학교, 사회복지시설

Length

Max length10
Median length4
Mean length4.5
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 14
58.3%
그 이외 8
33.3%
학교, 사회복지시설 2
 
8.3%

Length

2023-12-13T07:55:10.687070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:55:10.772349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 14
41.2%
8
23.5%
이외 8
23.5%
학교 2
 
5.9%
사회복지시설 2
 
5.9%

계약종별(3)
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size324.0 B
신축
10 
기존
10 
<NA>

Length

Max length4
Median length2
Mean length2.3333333
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신축
2nd row기존
3rd row신축
4th row신축
5th row신축

Common Values

ValueCountFrequency (%)
신축 10
41.7%
기존 10
41.7%
<NA> 4
 
16.7%

Length

2023-12-13T07:55:10.877091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:55:10.989183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신축 10
41.7%
기존 10
41.7%
na 4
 
16.7%

기준단위
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size324.0 B
연결열부하
22 
계약면적
 
2

Length

Max length5
Median length5
Mean length4.9166667
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row계약면적
2nd row계약면적
3rd row연결열부하
4th row연결열부하
5th row연결열부하

Common Values

ValueCountFrequency (%)
연결열부하 22
91.7%
계약면적 2
 
8.3%

Length

2023-12-13T07:55:11.088487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:55:11.165011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연결열부하 22
91.7%
계약면적 2
 
8.3%

적용단위(기준)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size324.0 B
Mcal/h
22 
 
2

Length

Max length6
Median length6
Mean length5.5833333
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd rowMcal/h
4th rowMcal/h
5th rowMcal/h

Common Values

ValueCountFrequency (%)
Mcal/h 22
91.7%
2
 
8.3%

Length

2023-12-13T07:55:11.249002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:55:11.325015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
mcal/h 22
91.7%
2
 
8.3%

용량범위
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)37.5%
Missing0
Missing (%)0.0%
Memory size324.0 B
<NA>
0_300
301_1000
1001_3500
3501이상
Other values (4)

Length

Max length9
Median length7
Mean length6.5833333
Min length4

Unique

Unique4 ?
Unique (%)16.7%

Sample

1st row<NA>
2nd row<NA>
3rd row0_300
4th row301_1000
5th row1001_3500

Common Values

ValueCountFrequency (%)
<NA> 4
16.7%
0_300 4
16.7%
301_1000 4
16.7%
1001_3500 4
16.7%
3501이상 4
16.7%
0_1000 1
 
4.2%
1001_2000 1
 
4.2%
2001_3000 1
 
4.2%
3001이상 1
 
4.2%

Length

2023-12-13T07:55:11.409529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:55:11.504906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 4
16.7%
0_300 4
16.7%
301_1000 4
16.7%
1001_3500 4
16.7%
3501이상 4
16.7%
0_1000 1
 
4.2%
1001_2000 1
 
4.2%
2001_3000 1
 
4.2%
3001이상 1
 
4.2%

적용단위(용량)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size324.0 B
Mcal/h
20 
<NA>

Length

Max length6
Median length6
Mean length5.6666667
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd rowMcal/h
4th rowMcal/h
5th rowMcal/h

Common Values

ValueCountFrequency (%)
Mcal/h 20
83.3%
<NA> 4
 
16.7%

Length

2023-12-13T07:55:11.629863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:55:11.718624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
mcal/h 20
83.3%
na 4
 
16.7%

단가
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean122512.5
Minimum7050
Maximum429300
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-13T07:55:11.793342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7050
5-th percentile19779
Q173322.5
median98155
Q3136235
95-th percentile263935
Maximum429300
Range422250
Interquartile range (IQR)62912.5

Descriptive statistics

Standard deviation90090.218
Coefficient of variation (CV)0.73535531
Kurtosis5.0591758
Mean122512.5
Median Absolute Deviation (MAD)28440
Skewness1.949085
Sum2940300
Variance8.1162474 × 109
MonotonicityNot monotonic
2023-12-13T07:55:11.891004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
14040 1
 
4.2%
118220 1
 
4.2%
208600 1
 
4.2%
230700 1
 
4.2%
269800 1
 
4.2%
429300 1
 
4.2%
62370 1
 
4.2%
71140 1
 
4.2%
74050 1
 
4.2%
87060 1
 
4.2%
Other values (14) 14
58.3%
ValueCountFrequency (%)
7050 1
4.2%
14040 1
4.2%
52300 1
4.2%
62370 1
4.2%
69300 1
4.2%
71140 1
4.2%
74050 1
4.2%
79050 1
4.2%
82290 1
4.2%
87060 1
4.2%
ValueCountFrequency (%)
429300 1
4.2%
269800 1
4.2%
230700 1
4.2%
208600 1
4.2%
167630 1
4.2%
150860 1
4.2%
131360 1
4.2%
126180 1
4.2%
118220 1
4.2%
113560 1
4.2%

Interactions

2023-12-13T07:55:09.753870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:55:11.959810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분계약종별(1)계약종별(2)계약종별(3)기준단위적용단위(기준)용량범위단가
구분1.0001.000NaNNaN0.0000.0001.0001.000
계약종별(1)1.0001.000NaN0.0001.0001.0000.6110.961
계약종별(2)NaNNaN1.0000.000NaNNaNNaN0.000
계약종별(3)NaN0.0000.0001.0000.0000.0000.0000.923
기준단위0.0001.000NaN0.0001.0000.900NaN1.000
적용단위(기준)0.0001.000NaN0.0000.9001.000NaN1.000
용량범위1.0000.611NaN0.000NaNNaN1.0000.907
단가1.0000.9610.0000.9231.0001.0000.9071.000
2023-12-13T07:55:12.097770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
적용단위(용량)계약종별(1)계약종별(3)구분적용단위(기준)기준단위계약종별(2)용량범위
적용단위(용량)1.0001.0001.0001.0001.0001.0001.0001.000
계약종별(1)1.0001.0000.0000.9530.9530.9531.0000.383
계약종별(3)1.0000.0001.0001.0000.0000.0000.0000.000
구분1.0000.9531.0001.0000.0000.0001.0000.816
적용단위(기준)1.0000.9530.0000.0001.0000.7121.0001.000
기준단위1.0000.9530.0000.0000.7121.0001.0001.000
계약종별(2)1.0001.0000.0001.0001.0001.0001.0001.000
용량범위1.0000.3830.0000.8161.0001.0001.0001.000
2023-12-13T07:55:12.215158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단가구분계약종별(1)계약종별(2)계약종별(3)기준단위적용단위(기준)용량범위적용단위(용량)
단가1.0000.8530.6610.0000.6440.8530.8530.7351.000
구분0.8531.0000.9531.0001.0000.0000.0000.8161.000
계약종별(1)0.6610.9531.0001.0000.0000.9530.9530.3831.000
계약종별(2)0.0001.0001.0001.0000.0001.0001.0001.0001.000
계약종별(3)0.6441.0000.0000.0001.0000.0000.0000.0001.000
기준단위0.8530.0000.9531.0000.0001.0000.7121.0001.000
적용단위(기준)0.8530.0000.9531.0000.0000.7121.0001.0001.000
용량범위0.7350.8160.3831.0000.0001.0001.0001.0001.000
적용단위(용량)1.0001.0001.0001.0001.0001.0001.0001.0001.000

Missing values

2023-12-13T07:55:09.841957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:55:09.958729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분계약종별(1)계약종별(2)계약종별(3)기준단위적용단위(기준)용량범위적용단위(용량)단가
0온수주택용<NA>신축계약면적<NA><NA>14040
1온수주택용<NA>기존계약면적<NA><NA>7050
2온수업무용<NA>신축연결열부하Mcal/h0_300Mcal/h167630
3온수업무용<NA>신축연결열부하Mcal/h301_1000Mcal/h131360
4온수업무용<NA>신축연결열부하Mcal/h1001_3500Mcal/h126180
5온수업무용<NA>신축연결열부하Mcal/h3501이상Mcal/h110630
6온수업무용<NA>기존연결열부하Mcal/h0_300Mcal/h96740
7온수업무용<NA>기존연결열부하Mcal/h301_1000Mcal/h82290
8온수업무용<NA>기존연결열부하Mcal/h1001_3500Mcal/h79050
9온수업무용<NA>기존연결열부하Mcal/h3501이상Mcal/h69300
구분계약종별(1)계약종별(2)계약종별(3)기준단위적용단위(기준)용량범위적용단위(용량)단가
14온수공공용그 이외신축연결열부하Mcal/h1001_3500Mcal/h113560
15온수공공용그 이외신축연결열부하Mcal/h3501이상Mcal/h99570
16온수공공용그 이외기존연결열부하Mcal/h0_300Mcal/h87060
17온수공공용그 이외기존연결열부하Mcal/h301_1000Mcal/h74050
18온수공공용그 이외기존연결열부하Mcal/h1001_3500Mcal/h71140
19온수공공용그 이외기존연결열부하Mcal/h3501이상Mcal/h62370
20냉수전체<NA><NA>연결열부하Mcal/h0_1000Mcal/h429300
21냉수전체<NA><NA>연결열부하Mcal/h1001_2000Mcal/h269800
22냉수전체<NA><NA>연결열부하Mcal/h2001_3000Mcal/h230700
23냉수전체<NA><NA>연결열부하Mcal/h3001이상Mcal/h208600