Overview

Dataset statistics

Number of variables5
Number of observations95
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.1 KiB
Average record size in memory44.4 B

Variable types

Categorical3
Numeric2

Dataset

Description한국전기안전공사에서 제공하는 최근 (19년 ~ 22년)동안 연료전지의 사용전검사 데이터입니다. 용도구분(자가용, 사업용), 점검용량, 점검건수를 확인하실 수 있습니다.
URLhttps://www.data.go.kr/data/15103225/fileData.do

Alerts

발전기종류 has constant value ""Constant
발전기용량 is highly overall correlated with 용도High correlation
건수 is highly overall correlated with 용도High correlation
용도 is highly overall correlated with 발전기용량 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 20:54:20.180270
Analysis finished2023-12-12 20:54:20.716189
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Categorical

Distinct4
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size892.0 B
2019
29 
2021
26 
2020
23 
2022
17 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019 29
30.5%
2021 26
27.4%
2020 23
24.2%
2022 17
17.9%

Length

2023-12-13T05:54:20.766114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:54:20.839963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 29
30.5%
2021 26
27.4%
2020 23
24.2%
2022 17
17.9%

용도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size892.0 B
자가용
79 
사업용
16 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자가용
2nd row자가용
3rd row자가용
4th row자가용
5th row자가용

Common Values

ValueCountFrequency (%)
자가용 79
83.2%
사업용 16
 
16.8%

Length

2023-12-13T05:54:20.922077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:54:20.994497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자가용 79
83.2%
사업용 16
 
16.8%

발전기종류
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size892.0 B
연료전지
95 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row연료전지
2nd row연료전지
3rd row연료전지
4th row연료전지
5th row연료전지

Common Values

ValueCountFrequency (%)
연료전지 95
100.0%

Length

2023-12-13T05:54:21.069153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:54:21.140587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연료전지 95
100.0%

발전기용량
Real number (ℝ)

HIGH CORRELATION 

Distinct42
Distinct (%)44.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean103.95789
Minimum1
Maximum2500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size987.0 B
2023-12-13T05:54:21.214734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q17
median16
Q356
95-th percentile440
Maximum2500
Range2499
Interquartile range (IQR)49

Descriptive statistics

Standard deviation298.07231
Coefficient of variation (CV)2.8672408
Kurtosis45.497889
Mean103.95789
Median Absolute Deviation (MAD)11
Skewness6.1268938
Sum9876
Variance88847.105
MonotonicityNot monotonic
2023-12-13T05:54:21.335433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
1 4
 
4.2%
3 4
 
4.2%
440 4
 
4.2%
5 4
 
4.2%
6 4
 
4.2%
300 4
 
4.2%
8 4
 
4.2%
10 4
 
4.2%
105 4
 
4.2%
2 4
 
4.2%
Other values (32) 55
57.9%
ValueCountFrequency (%)
1 4
4.2%
2 4
4.2%
3 4
4.2%
4 2
2.1%
5 4
4.2%
6 4
4.2%
7 3
3.2%
8 4
4.2%
10 4
4.2%
11 3
3.2%
ValueCountFrequency (%)
2500 1
 
1.1%
960 1
 
1.1%
800 1
 
1.1%
540 1
 
1.1%
440 4
4.2%
300 4
4.2%
180 1
 
1.1%
105 4
4.2%
100 2
2.1%
75 1
 
1.1%

건수
Real number (ℝ)

HIGH CORRELATION 

Distinct39
Distinct (%)41.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54.957895
Minimum1
Maximum667
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size987.0 B
2023-12-13T05:54:21.433594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median4
Q321.5
95-th percentile381
Maximum667
Range666
Interquartile range (IQR)20.5

Descriptive statistics

Standard deviation126.869
Coefficient of variation (CV)2.3084763
Kurtosis9.0900501
Mean54.957895
Median Absolute Deviation (MAD)3
Skewness3.0240971
Sum5221
Variance16095.743
MonotonicityNot monotonic
2023-12-13T05:54:21.765950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
1 26
27.4%
2 10
 
10.5%
3 9
 
9.5%
4 6
 
6.3%
16 4
 
4.2%
13 3
 
3.2%
10 2
 
2.1%
7 2
 
2.1%
8 2
 
2.1%
6 2
 
2.1%
Other values (29) 29
30.5%
ValueCountFrequency (%)
1 26
27.4%
2 10
 
10.5%
3 9
 
9.5%
4 6
 
6.3%
5 1
 
1.1%
6 2
 
2.1%
7 2
 
2.1%
8 2
 
2.1%
9 1
 
1.1%
10 2
 
2.1%
ValueCountFrequency (%)
667 1
1.1%
552 1
1.1%
440 1
1.1%
427 1
1.1%
416 1
1.1%
366 1
1.1%
329 1
1.1%
263 1
1.1%
243 1
1.1%
180 1
1.1%

Interactions

2023-12-13T05:54:20.433232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:54:20.298284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:54:20.504019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:54:20.362128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:54:21.839923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도용도발전기용량건수
연도1.0000.3760.2070.000
용도0.3761.0000.7030.550
발전기용량0.2070.7031.0000.549
건수0.0000.5500.5491.000
2023-12-13T05:54:21.907853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용도연도
용도1.0000.248
연도0.2481.000
2023-12-13T05:54:21.973422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발전기용량건수연도용도
발전기용량1.000-0.1760.1680.824
건수-0.1761.0000.0000.531
연도0.1680.0001.0000.248
용도0.8240.5310.2481.000

Missing values

2023-12-13T05:54:20.614088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:54:20.687131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도용도발전기종류발전기용량건수
02019자가용연료전지1130
12019자가용연료전지26
22019자가용연료전지310
32019자가용연료전지41
42019자가용연료전지541
52019자가용연료전지637
62019자가용연료전지77
72019자가용연료전지81
82019자가용연료전지10115
92019자가용연료전지113
연도용도발전기종류발전기용량건수
852022자가용연료전지153
862022자가용연료전지206
872022자가용연료전지301
882022사업용연료전지440263
892022사업용연료전지300440
902022사업용연료전지9601
912022사업용연료전지8001
922022사업용연료전지1053
932022사업용연료전지25002
942022사업용연료전지5401