Overview

Dataset statistics

Number of variables17
Number of observations32
Missing cells83
Missing cells (%)15.3%
Duplicate rows14
Duplicate rows (%)43.8%
Total size in memory4.4 KiB
Average record size in memory140.1 B

Variable types

Text2
Unsupported15

Dataset

Description하수처리장에서 사용되는 전력을 저감하기 위해 설치된 2018~2022년간 재생에너지의 발전량, 전력비 절감액 등 현황 자료 입니다.
URLhttps://www.data.go.kr/data/15112865/fileData.do

Alerts

Unnamed: 16 has constant value ""Constant
Dataset has 14 (43.8%) duplicate rowsDuplicates
소수력 has 3 (9.4%) missing valuesMissing
Unnamed: 1 has 2 (6.2%) missing valuesMissing
Unnamed: 2 has 4 (12.5%) missing valuesMissing
Unnamed: 3 has 4 (12.5%) missing valuesMissing
Unnamed: 4 has 2 (6.2%) missing valuesMissing
Unnamed: 5 has 4 (12.5%) missing valuesMissing
Unnamed: 6 has 4 (12.5%) missing valuesMissing
Unnamed: 7 has 2 (6.2%) missing valuesMissing
Unnamed: 8 has 4 (12.5%) missing valuesMissing
Unnamed: 9 has 4 (12.5%) missing valuesMissing
Unnamed: 10 has 2 (6.2%) missing valuesMissing
Unnamed: 11 has 4 (12.5%) missing valuesMissing
Unnamed: 12 has 4 (12.5%) missing valuesMissing
Unnamed: 13 has 2 (6.2%) missing valuesMissing
Unnamed: 14 has 4 (12.5%) missing valuesMissing
Unnamed: 15 has 4 (12.5%) missing valuesMissing
Unnamed: 16 has 30 (93.8%) missing valuesMissing
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 22:53:41.119558
Analysis finished2023-12-12 22:53:42.092106
Duration0.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

소수력
Text

MISSING 

Distinct16
Distinct (%)55.2%
Missing3
Missing (%)9.4%
Memory size388.0 B
2023-12-13T07:53:42.194780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length2
Mean length2.9310345
Min length1

Characters and Unicode

Total characters85
Distinct characters24
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)10.3%

Sample

1st row구   분 (100kW)
2nd row
3rd row1월
4th row2월
5th row3월
ValueCountFrequency (%)
6월 2
 
6.1%
4월 2
 
6.1%
2
 
6.1%
2
 
6.1%
2
 
6.1%
1월 2
 
6.1%
2월 2
 
6.1%
12월 2
 
6.1%
3월 2
 
6.1%
5월 2
 
6.1%
Other values (8) 13
39.4%
2023-12-13T07:53:42.447169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
28.2%
1 12
14.1%
0 5
 
5.9%
2 4
 
4.7%
  4
 
4.7%
8 3
 
3.5%
6 2
 
2.4%
2
 
2.4%
W 2
 
2.4%
k 2
 
2.4%
Other values (14) 25
29.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 36
42.4%
Other Letter 33
38.8%
Space Separator 6
 
7.1%
Uppercase Letter 2
 
2.4%
Lowercase Letter 2
 
2.4%
Open Punctuation 2
 
2.4%
Control 2
 
2.4%
Close Punctuation 2
 
2.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 12
33.3%
0 5
13.9%
2 4
 
11.1%
8 3
 
8.3%
6 2
 
5.6%
3 2
 
5.6%
4 2
 
5.6%
5 2
 
5.6%
7 2
 
5.6%
9 2
 
5.6%
Other Letter
ValueCountFrequency (%)
24
72.7%
2
 
6.1%
2
 
6.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
1
 
3.0%
Space Separator
ValueCountFrequency (%)
  4
66.7%
2
33.3%
Uppercase Letter
ValueCountFrequency (%)
W 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
k 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Control
ValueCountFrequency (%)
2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 48
56.5%
Hangul 33
38.8%
Latin 4
 
4.7%

Most frequent character per script

Common
ValueCountFrequency (%)
1 12
25.0%
0 5
10.4%
2 4
 
8.3%
  4
 
8.3%
8 3
 
6.2%
6 2
 
4.2%
( 2
 
4.2%
2
 
4.2%
2
 
4.2%
) 2
 
4.2%
Other values (5) 10
20.8%
Hangul
ValueCountFrequency (%)
24
72.7%
2
 
6.1%
2
 
6.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
1
 
3.0%
Latin
ValueCountFrequency (%)
W 2
50.0%
k 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 48
56.5%
Hangul 33
38.8%
None 4
 
4.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
72.7%
2
 
6.1%
2
 
6.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
1
 
3.0%
ASCII
ValueCountFrequency (%)
1 12
25.0%
0 5
10.4%
2 4
 
8.3%
8 3
 
6.2%
6 2
 
4.2%
W 2
 
4.2%
k 2
 
4.2%
( 2
 
4.2%
2
 
4.2%
2
 
4.2%
Other values (6) 12
25.0%
None
ValueCountFrequency (%)
  4
100.0%

Unnamed: 1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.2%
Memory size388.0 B

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)12.5%
Memory size388.0 B

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)12.5%
Memory size388.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.2%
Memory size388.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)12.5%
Memory size388.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)12.5%
Memory size388.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.2%
Memory size388.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)12.5%
Memory size388.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)12.5%
Memory size388.0 B

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.2%
Memory size388.0 B

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)12.5%
Memory size388.0 B

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)12.5%
Memory size388.0 B

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.2%
Memory size388.0 B

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)12.5%
Memory size388.0 B

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)12.5%
Memory size388.0 B

Unnamed: 16
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)50.0%
Missing30
Missing (%)93.8%
Memory size388.0 B
2023-12-13T07:53:42.523834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters4
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row비고
2nd row비고
ValueCountFrequency (%)
비고 2
100.0%
2023-12-13T07:53:42.717019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2
50.0%
2
50.0%

Missing values

2023-12-13T07:53:41.256565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:53:41.455698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T07:53:41.924601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

소수력Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16
0구   분 (100kW)2018년도NaNNaN2019년도NaNNaN2020년도NaNNaN2021년도NaNNaN2022년도NaNNaN비고
1<NA>발전량\n[kwh]전력비\n절감액[원]석유환산톤\n(toe/0.215)발전량\n[kwh]전력비\n절감액[원]석유환산톤\n(toe/0.213)발전량\n[kwh]전력비\n절감액[원]석유환산톤\n(toe/0.213)발전량\n[kwh]전력비\n절감액[원]석유환산톤\n(toe/0.215)발전량\n[kwh]전력비\n절감액[원]석유환산톤\n(toe/0.213)<NA>
23453224316525074.244232447423059275052.1300462410923013650051.352596256514.18181832064272.72727354.6375211216111520137525.903143<NA>
31월2924136551256.2868152399629995005.1111481554219427503.3104462009325116254.279809000<NA>
42월2761234515005.936582083526043754.4378551531419142503.2618822083526043754.437855000<NA>
53월3353241915007.209382181427267504.6463821893323666254.0327292081426017504.433382345431250.073485<NA>
64월3091538643756.6467252088126101254.4476531933624170004.1185682221627770004.732008000<NA>
75월3248740608756.9847052223327791254.7356292091926148754.4557471759221990003.7470961851823147503.944334<NA>
86월2998037475006.44572124326553754.5247592262428280004.8189122339229240004.98249634364295000.731868<NA>
97월3206540081256.8939752078125976254.4263531899123738754.0450832416930211255.1479972038325478754.341579<NA>
소수력Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16
223월2018125226254.3389152016425205004.2949321749421867503.7262221755421942503.73900274989372501.597074<NA>
234월2260228252504.859431893223665004.0325162106926336254.4876972235327941254.7611892878435980006.130992<NA>
245월2152826910004.628522350929386255.0074172188527356254.6615051973424667504.2033422265828322504.826154<NA>
256월2027625345004.359341869323366253.9816092211627645004.7107082272428405004.8402121263115788752.690403<NA>
267월1923724046254.1359551639120488753.4912831318116476252.8075531821722771253.8802212199727496254.685361<NA>
278월1800022500003.871840123001253.9194131527219090003.2529361216215202502.5905061320616507502.812878<NA>
289월1750621882503.763791583119788753.3720031735921698753.6974671020912761252.1745171400617507502.983278<NA>
2910월1966732730004.2284051866223327503.9750061880823510004.00610468328540001.4552161343316791252.861229<NA>
3011월1778022225003.82271540019250003.28021451318141253.0912691279515993752.72533558767345001.251588<NA>
3112월1442518031253.1013751629020362503.469771326116576252.82459314214.417768003.0276671500218752503.195426<NA>

Duplicate rows

Most frequently occurring

소수력Unnamed: 16# duplicates
13<NA><NA>3
010월<NA>2
111월<NA>2
212월<NA>2
31월<NA>2
42월<NA>2
53월<NA>2
64월<NA>2
75월<NA>2
86월<NA>2