Overview

Dataset statistics

Number of variables7
Number of observations37
Missing cells26
Missing cells (%)10.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory60.6 B

Variable types

Numeric1
Text3
Categorical3

Dataset

Description한국중부발전(주)의 공공데이터 개방목록 현황이며, 항목명은 "데이터번호", "목록명", "유형", "API명", "파일데이터", "파일데이터 제공방법", "파일형식"으로 이루어져 있습니다.
URLhttps://www.data.go.kr/data/15117435/fileData.do

Alerts

파일데이터 제공방법 is highly overall correlated with 데이터 번호 and 2 other fieldsHigh correlation
파일형식 is highly overall correlated with 데이터 번호 and 2 other fieldsHigh correlation
유형 is highly overall correlated with 데이터 번호 and 2 other fieldsHigh correlation
데이터 번호 is highly overall correlated with 유형 and 2 other fieldsHigh correlation
API명 has 16 (43.2%) missing valuesMissing
파일데이터명 has 10 (27.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 03:15:29.298141
Analysis finished2023-12-12 03:15:30.381921
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

데이터 번호
Real number (ℝ)

HIGH CORRELATION 

Distinct34
Distinct (%)91.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.945946
Minimum1
Maximum34
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-12T12:15:30.501037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.8
Q19
median16
Q325
95-th percentile32.2
Maximum34
Range33
Interquartile range (IQR)16

Descriptive statistics

Standard deviation9.7437899
Coefficient of variation (CV)0.57499239
Kurtosis-1.1540281
Mean16.945946
Median Absolute Deviation (MAD)8
Skewness0.14018513
Sum627
Variance94.941441
MonotonicityIncreasing
2023-12-12T12:15:30.680405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
9 3
 
8.1%
14 2
 
5.4%
1 1
 
2.7%
27 1
 
2.7%
22 1
 
2.7%
23 1
 
2.7%
24 1
 
2.7%
25 1
 
2.7%
26 1
 
2.7%
28 1
 
2.7%
Other values (24) 24
64.9%
ValueCountFrequency (%)
1 1
 
2.7%
2 1
 
2.7%
3 1
 
2.7%
4 1
 
2.7%
5 1
 
2.7%
6 1
 
2.7%
7 1
 
2.7%
8 1
 
2.7%
9 3
8.1%
10 1
 
2.7%
ValueCountFrequency (%)
34 1
2.7%
33 1
2.7%
32 1
2.7%
31 1
2.7%
30 1
2.7%
29 1
2.7%
28 1
2.7%
27 1
2.7%
26 1
2.7%
25 1
2.7%
Distinct33
Distinct (%)89.2%
Missing0
Missing (%)0.0%
Memory size428.0 B
2023-12-12T12:15:30.986391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length11.72973
Min length5

Characters and Unicode

Total characters434
Distinct characters102
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)81.1%

Sample

1st row발전용수 생산 및 사용량
2nd row중부발전 산업재산권 보유현황
3rd row발전소 온배수 정보
4th row중부발전 발전설비 현황
5th row신재생에너지 설비 현황
ValueCountFrequency (%)
정보 10
 
10.5%
중부발전 9
 
9.5%
발전소 4
 
4.2%
신재생에너지 4
 
4.2%
서비스 3
 
3.2%
입찰정보 3
 
3.2%
현황 3
 
3.2%
폐기물처리정보 2
 
2.1%
조회서비스 2
 
2.1%
정보화사업 2
 
2.1%
Other values (49) 53
55.8%
2023-12-12T12:15:31.442235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
58
 
13.4%
27
 
6.2%
27
 
6.2%
22
 
5.1%
21
 
4.8%
12
 
2.8%
10
 
2.3%
10
 
2.3%
9
 
2.1%
9
 
2.1%
Other values (92) 229
52.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 371
85.5%
Space Separator 58
 
13.4%
Uppercase Letter 3
 
0.7%
Close Punctuation 1
 
0.2%
Open Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
 
7.3%
27
 
7.3%
22
 
5.9%
21
 
5.7%
12
 
3.2%
10
 
2.7%
10
 
2.7%
9
 
2.4%
9
 
2.4%
9
 
2.4%
Other values (86) 215
58.0%
Uppercase Letter
ValueCountFrequency (%)
E 1
33.3%
C 1
33.3%
R 1
33.3%
Space Separator
ValueCountFrequency (%)
58
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 371
85.5%
Common 60
 
13.8%
Latin 3
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
 
7.3%
27
 
7.3%
22
 
5.9%
21
 
5.7%
12
 
3.2%
10
 
2.7%
10
 
2.7%
9
 
2.4%
9
 
2.4%
9
 
2.4%
Other values (86) 215
58.0%
Common
ValueCountFrequency (%)
58
96.7%
) 1
 
1.7%
( 1
 
1.7%
Latin
ValueCountFrequency (%)
E 1
33.3%
C 1
33.3%
R 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 371
85.5%
ASCII 63
 
14.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
58
92.1%
E 1
 
1.6%
) 1
 
1.6%
C 1
 
1.6%
R 1
 
1.6%
( 1
 
1.6%
Hangul
ValueCountFrequency (%)
27
 
7.3%
27
 
7.3%
22
 
5.9%
21
 
5.7%
12
 
3.2%
10
 
2.7%
10
 
2.7%
9
 
2.4%
9
 
2.4%
9
 
2.4%
Other values (86) 215
58.0%

유형
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)10.8%
Missing0
Missing (%)0.0%
Memory size428.0 B
F
16 
F,A
A
<NA>

Length

Max length4
Median length1
Mean length1.7297297
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowF
3rd rowF,A
4th rowF
5th rowF

Common Values

ValueCountFrequency (%)
F 16
43.2%
F,A 9
24.3%
A 9
24.3%
<NA> 3
 
8.1%

Length

2023-12-12T12:15:31.620586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:15:31.738489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
f 16
43.2%
f,a 9
24.3%
a 9
24.3%
na 3
 
8.1%

API명
Text

MISSING 

Distinct21
Distinct (%)100.0%
Missing16
Missing (%)43.2%
Memory size428.0 B
2023-12-12T12:15:31.994287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length15
Mean length13.190476
Min length6

Characters and Unicode

Total characters277
Distinct characters78
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row중부발전 발전소 온배수 배출량
2nd row중부발전 신재생에너지 건설현황
3rd row중부발전 REC 현물시장 거래현황
4th row입찰계획정보 다운로드 서비스
5th row자재입찰정보
ValueCountFrequency (%)
서비스 7
 
11.9%
중부발전 6
 
10.2%
발전소 4
 
6.8%
다운로드 4
 
6.8%
조회서비스 2
 
3.4%
정보 2
 
3.4%
조회 2
 
3.4%
건설현황 2
 
3.4%
연료소비실적 1
 
1.7%
주변 1
 
1.7%
Other values (28) 28
47.5%
2023-12-12T12:15:32.476789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
 
13.7%
14
 
5.1%
14
 
5.1%
13
 
4.7%
12
 
4.3%
11
 
4.0%
9
 
3.2%
9
 
3.2%
7
 
2.5%
6
 
2.2%
Other values (68) 144
52.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 236
85.2%
Space Separator 38
 
13.7%
Uppercase Letter 3
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
5.9%
14
 
5.9%
13
 
5.5%
12
 
5.1%
11
 
4.7%
9
 
3.8%
9
 
3.8%
7
 
3.0%
6
 
2.5%
6
 
2.5%
Other values (64) 135
57.2%
Uppercase Letter
ValueCountFrequency (%)
R 1
33.3%
E 1
33.3%
C 1
33.3%
Space Separator
ValueCountFrequency (%)
38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 236
85.2%
Common 38
 
13.7%
Latin 3
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
5.9%
14
 
5.9%
13
 
5.5%
12
 
5.1%
11
 
4.7%
9
 
3.8%
9
 
3.8%
7
 
3.0%
6
 
2.5%
6
 
2.5%
Other values (64) 135
57.2%
Latin
ValueCountFrequency (%)
R 1
33.3%
E 1
33.3%
C 1
33.3%
Common
ValueCountFrequency (%)
38
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 236
85.2%
ASCII 41
 
14.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38
92.7%
R 1
 
2.4%
E 1
 
2.4%
C 1
 
2.4%
Hangul
ValueCountFrequency (%)
14
 
5.9%
14
 
5.9%
13
 
5.5%
12
 
5.1%
11
 
4.7%
9
 
3.8%
9
 
3.8%
7
 
3.0%
6
 
2.5%
6
 
2.5%
Other values (64) 135
57.2%

파일데이터명
Text

MISSING 

Distinct27
Distinct (%)100.0%
Missing10
Missing (%)27.0%
Memory size428.0 B
2023-12-12T12:15:32.754709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length24
Mean length18.555556
Min length1

Characters and Unicode

Total characters501
Distinct characters120
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row발전용수 취수량 및 사용량(2018)
2nd row중부발전 산업재산권 보유현황(2019)
3rd row중부발전온배수 배출량 정보(2018)
4th row중부발전 발전설비 현황(2019.08.22)
5th row신재생에너지 설비 정보(2018)
ValueCountFrequency (%)
중부발전 7
 
9.1%
정보 3
 
3.9%
3
 
3.9%
정보(2018 3
 
3.9%
신재생에너지 3
 
3.9%
배출량 2
 
2.6%
발전설비 2
 
2.6%
정보화사업 2
 
2.6%
발전용수 1
 
1.3%
시설물 1
 
1.3%
Other values (50) 50
64.9%
2023-12-12T12:15:33.219509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52
 
10.4%
0 28
 
5.6%
1 24
 
4.8%
2 21
 
4.2%
18
 
3.6%
18
 
3.6%
17
 
3.4%
) 17
 
3.4%
( 17
 
3.4%
15
 
3.0%
Other values (110) 274
54.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 300
59.9%
Decimal Number 103
 
20.6%
Space Separator 52
 
10.4%
Close Punctuation 17
 
3.4%
Open Punctuation 17
 
3.4%
Other Punctuation 7
 
1.4%
Uppercase Letter 3
 
0.6%
Dash Punctuation 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
6.0%
18
 
6.0%
17
 
5.7%
15
 
5.0%
12
 
4.0%
11
 
3.7%
8
 
2.7%
8
 
2.7%
7
 
2.3%
7
 
2.3%
Other values (95) 179
59.7%
Decimal Number
ValueCountFrequency (%)
0 28
27.2%
1 24
23.3%
2 21
20.4%
8 15
14.6%
7 6
 
5.8%
9 6
 
5.8%
3 3
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
C 1
33.3%
E 1
33.3%
R 1
33.3%
Space Separator
ValueCountFrequency (%)
52
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Other Punctuation
ValueCountFrequency (%)
. 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 300
59.9%
Common 198
39.5%
Latin 3
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
6.0%
18
 
6.0%
17
 
5.7%
15
 
5.0%
12
 
4.0%
11
 
3.7%
8
 
2.7%
8
 
2.7%
7
 
2.3%
7
 
2.3%
Other values (95) 179
59.7%
Common
ValueCountFrequency (%)
52
26.3%
0 28
14.1%
1 24
12.1%
2 21
10.6%
) 17
 
8.6%
( 17
 
8.6%
8 15
 
7.6%
. 7
 
3.5%
7 6
 
3.0%
9 6
 
3.0%
Other values (2) 5
 
2.5%
Latin
ValueCountFrequency (%)
C 1
33.3%
E 1
33.3%
R 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 300
59.9%
ASCII 201
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
52
25.9%
0 28
13.9%
1 24
11.9%
2 21
10.4%
) 17
 
8.5%
( 17
 
8.5%
8 15
 
7.5%
. 7
 
3.5%
7 6
 
3.0%
9 6
 
3.0%
Other values (5) 8
 
4.0%
Hangul
ValueCountFrequency (%)
18
 
6.0%
18
 
6.0%
17
 
5.7%
15
 
5.0%
12
 
4.0%
11
 
3.7%
8
 
2.7%
8
 
2.7%
7
 
2.3%
7
 
2.3%
Other values (95) 179
59.7%

파일데이터 제공방법
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size428.0 B
공공데이터 포털 업로드
21 
<NA>
16 

Length

Max length12
Median length12
Mean length8.5405405
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공데이터 포털 업로드
2nd row공공데이터 포털 업로드
3rd row공공데이터 포털 업로드
4th row공공데이터 포털 업로드
5th row공공데이터 포털 업로드

Common Values

ValueCountFrequency (%)
공공데이터 포털 업로드 21
56.8%
<NA> 16
43.2%

Length

2023-12-12T12:15:33.415367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:15:33.579412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공공데이터 21
26.6%
포털 21
26.6%
업로드 21
26.6%
na 16
20.3%

파일형식
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size428.0 B
CSV
26 
<NA>
11 

Length

Max length4
Median length3
Mean length3.2972973
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCSV
2nd rowCSV
3rd rowCSV
4th rowCSV
5th rowCSV

Common Values

ValueCountFrequency (%)
CSV 26
70.3%
<NA> 11
29.7%

Length

2023-12-12T12:15:33.737633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:15:33.883579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
csv 26
70.3%
na 11
29.7%

Interactions

2023-12-12T12:15:29.729831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:15:34.003159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
데이터 번호목록명유형API명파일데이터명
데이터 번호1.0000.9850.7571.0001.000
목록명0.9851.0000.7711.0001.000
유형0.7570.7711.0001.0001.000
API명1.0001.0001.0001.0001.000
파일데이터명1.0001.0001.0001.0001.000
2023-12-12T12:15:34.135408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파일데이터 제공방법파일형식유형
파일데이터 제공방법1.0001.0001.000
파일형식1.0001.0001.000
유형1.0001.0001.000
2023-12-12T12:15:34.270301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
데이터 번호유형파일데이터 제공방법파일형식
데이터 번호1.0000.5501.0001.000
유형0.5501.0001.0001.000
파일데이터 제공방법1.0001.0001.0001.000
파일형식1.0001.0001.0001.000

Missing values

2023-12-12T12:15:29.909509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:15:30.120495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T12:15:30.280895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

데이터 번호목록명유형API명파일데이터명파일데이터 제공방법파일형식
01발전용수 생산 및 사용량F<NA>발전용수 취수량 및 사용량(2018)공공데이터 포털 업로드CSV
12중부발전 산업재산권 보유현황F<NA>중부발전 산업재산권 보유현황(2019)공공데이터 포털 업로드CSV
23발전소 온배수 정보F,A중부발전 발전소 온배수 배출량중부발전온배수 배출량 정보(2018)공공데이터 포털 업로드CSV
34중부발전 발전설비 현황F<NA>중부발전 발전설비 현황(2019.08.22)공공데이터 포털 업로드CSV
45신재생에너지 설비 현황F<NA>신재생에너지 설비 정보(2018)공공데이터 포털 업로드CSV
56신재생에너지 건설 및 개발 정보F,A중부발전 신재생에너지 건설현황신재생에너지 건설 및 개발 정보(2019.08.31)공공데이터 포털 업로드CSV
67신재생에너지 (REC) 기준가격 정보F,A중부발전 REC 현물시장 거래현황공급인증서(REC) 기준가격 정보(2018)공공데이터 포털 업로드CSV
78탈황석고 처리 정보F<NA>중부발전 탈황석고 발생량 및 판매량(2018)공공데이터 포털 업로드CSV
89중부발전 입찰정보F,A입찰계획정보 다운로드 서비스중부발전 입찰정보(2018)공공데이터 포털 업로드CSV
99중부발전 입찰정보<NA>자재입찰정보<NA><NA><NA>
데이터 번호목록명유형API명파일데이터명파일데이터 제공방법파일형식
2725신재생발전설비 발전정보A신재생발전설비 발전정보<NA><NA><NA>
2826발전소 주변 기상정보A발전소 주변 기상정보<NA><NA><NA>
2927정보화사업 발주예정정보F<NA>정보화사업 발주예정정보공공데이터 포털 업로드CSV
3028정보화사업 수요예보정보F<NA>정보화사업 수요예보정보공공데이터 포털 업로드CSV
3129시설물 개방 현황 정보F<NA>시설물 개방 현황 정보공공데이터 포털 업로드CSV
3230한국중부발전 지진가속계측기 설치현황F<NA>한국중부발전 지진가속계측기 설치현황<NA>CSV
3331신재생에너지 운영사업현황F<NA>신재생에너지 운영사업현황<NA>CSV
3432중소기업제품 구매실적 정보F<NA>중소기업제품 구매실적 정보<NA>CSV
3533국내외운영 신사업현황F<NA>국내외운영 신사업현황<NA>CSV
3634사업소별 네트워크 트래픽F<NA>사업소별 네트워크 트래픽<NA>CSV