Overview

Dataset statistics

Number of variables5
Number of observations26
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory50.1 B

Variable types

Numeric3
Categorical2

Dataset

Description도시가스 법정 품질검사(전체 검사 건/합격 건/불합격 건)에 대한 데이터로 2015년도부터 각 분기별로 검사 횟수를포함하고 있음
Author한국가스공사
URLhttps://www.data.go.kr/data/15066497/fileData.do

Alerts

불합격 건수 has constant value ""Constant
연도 is highly overall correlated with 검사횟수 and 1 other fieldsHigh correlation
검사횟수 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
합격건수 is highly overall correlated with 연도 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 14:55:20.443221
Analysis finished2023-12-12 14:55:21.697466
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

HIGH CORRELATION 

Distinct7
Distinct (%)26.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017.7692
Minimum2015
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-12T23:55:21.744100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2015
5-th percentile2015
Q12016
median2018
Q32019
95-th percentile2020.75
Maximum2021
Range6
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.9247377
Coefficient of variation (CV)0.00095389389
Kurtosis-1.1655043
Mean2017.7692
Median Absolute Deviation (MAD)2
Skewness0.063433589
Sum52462
Variance3.7046154
MonotonicityIncreasing
2023-12-12T23:55:21.846415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
2015 4
15.4%
2016 4
15.4%
2017 4
15.4%
2018 4
15.4%
2019 4
15.4%
2020 4
15.4%
2021 2
7.7%
ValueCountFrequency (%)
2015 4
15.4%
2016 4
15.4%
2017 4
15.4%
2018 4
15.4%
2019 4
15.4%
2020 4
15.4%
2021 2
7.7%
ValueCountFrequency (%)
2021 2
7.7%
2020 4
15.4%
2019 4
15.4%
2018 4
15.4%
2017 4
15.4%
2016 4
15.4%
2015 4
15.4%

분기
Categorical

Distinct4
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Memory size340.0 B
1
2
3
4

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row3
4th row4
5th row1

Common Values

ValueCountFrequency (%)
1 7
26.9%
2 7
26.9%
3 6
23.1%
4 6
23.1%

Length

2023-12-12T23:55:21.995556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:55:22.149434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 7
26.9%
2 7
26.9%
3 6
23.1%
4 6
23.1%

검사횟수
Real number (ℝ)

HIGH CORRELATION 

Distinct21
Distinct (%)80.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean203
Minimum172
Maximum275
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-12T23:55:22.261155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum172
5-th percentile175
Q1182
median190
Q3221.75
95-th percentile243.75
Maximum275
Range103
Interquartile range (IQR)39.75

Descriptive statistics

Standard deviation27.136691
Coefficient of variation (CV)0.13367828
Kurtosis0.18050247
Mean203
Median Absolute Deviation (MAD)14
Skewness0.92632711
Sum5278
Variance736.4
MonotonicityNot monotonic
2023-12-12T23:55:22.378495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
185 3
 
11.5%
182 2
 
7.7%
184 2
 
7.7%
216 2
 
7.7%
230 1
 
3.8%
235 1
 
3.8%
180 1
 
3.8%
179 1
 
3.8%
172 1
 
3.8%
174 1
 
3.8%
Other values (11) 11
42.3%
ValueCountFrequency (%)
172 1
 
3.8%
174 1
 
3.8%
178 1
 
3.8%
179 1
 
3.8%
180 1
 
3.8%
181 1
 
3.8%
182 2
7.7%
184 2
7.7%
185 3
11.5%
195 1
 
3.8%
ValueCountFrequency (%)
275 1
3.8%
244 1
3.8%
243 1
3.8%
235 1
3.8%
230 1
3.8%
223 1
3.8%
222 1
3.8%
221 1
3.8%
216 2
7.7%
211 1
3.8%

합격건수
Real number (ℝ)

HIGH CORRELATION 

Distinct21
Distinct (%)80.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean203
Minimum172
Maximum275
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-12T23:55:22.496108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum172
5-th percentile175
Q1182
median190
Q3221.75
95-th percentile243.75
Maximum275
Range103
Interquartile range (IQR)39.75

Descriptive statistics

Standard deviation27.136691
Coefficient of variation (CV)0.13367828
Kurtosis0.18050247
Mean203
Median Absolute Deviation (MAD)14
Skewness0.92632711
Sum5278
Variance736.4
MonotonicityNot monotonic
2023-12-12T23:55:22.654712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
185 3
 
11.5%
182 2
 
7.7%
184 2
 
7.7%
216 2
 
7.7%
230 1
 
3.8%
235 1
 
3.8%
180 1
 
3.8%
179 1
 
3.8%
172 1
 
3.8%
174 1
 
3.8%
Other values (11) 11
42.3%
ValueCountFrequency (%)
172 1
 
3.8%
174 1
 
3.8%
178 1
 
3.8%
179 1
 
3.8%
180 1
 
3.8%
181 1
 
3.8%
182 2
7.7%
184 2
7.7%
185 3
11.5%
195 1
 
3.8%
ValueCountFrequency (%)
275 1
3.8%
244 1
3.8%
243 1
3.8%
235 1
3.8%
230 1
3.8%
223 1
3.8%
222 1
3.8%
221 1
3.8%
216 2
7.7%
211 1
3.8%

불합격 건수
Categorical

CONSTANT 

Distinct1
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size340.0 B
0
26 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 26
100.0%

Length

2023-12-12T23:55:22.786701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:55:22.899562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 26
100.0%

Interactions

2023-12-12T23:55:21.199684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:55:20.587512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:55:20.899619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:55:21.299086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:55:20.691799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:55:20.994388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:55:21.386340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:55:20.797586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:55:21.104279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:55:22.969121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도분기검사횟수합격건수
연도1.0000.0000.7260.726
분기0.0001.0000.0000.000
검사횟수0.7260.0001.0001.000
합격건수0.7260.0001.0001.000
2023-12-12T23:55:23.062321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도검사횟수합격건수분기
연도1.000-0.590-0.5900.000
검사횟수-0.5901.0001.0000.000
합격건수-0.5901.0001.0000.000
분기0.0000.0000.0001.000

Missing values

2023-12-12T23:55:21.540614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:55:21.655628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도분기검사횟수합격건수불합격 건수
0201512302300
1201522432430
2201532112110
3201541951950
4201612222220
5201622752750
6201631961960
7201641841840
8201711811810
9201722232230
연도분기검사횟수합격건수불합격 건수
16201911781780
17201921741740
18201931721720
19201941791790
20202011841840
21202021801800
22202031821820
23202041851850
24202111851850
25202121821820