Overview

Dataset statistics

Number of variables6
Number of observations144
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.4 KiB
Average record size in memory52.9 B

Variable types

Numeric4
Categorical2

Dataset

Description유해화학물질 취급시설 검사 시설 타입 및 공량 수수료 (시작용량, 끝용량 ,시설용량보정계수 ,검사방법아이디, 시설타입아이디, 시설공량)시작용량 끝용량 범위에 따른 시설별 공량 산정
Author한국환경공단
URLhttps://www.data.go.kr/data/15087754/fileData.do

Alerts

시작용량 is highly overall correlated with 끝용량 and 1 other fieldsHigh correlation
끝용량 is highly overall correlated with 시작용량 and 1 other fieldsHigh correlation
시설용량보정계수 is highly overall correlated with 시작용량 and 1 other fieldsHigh correlation
시설공량 is highly overall correlated with 검사방법아이디High correlation
검사방법아이디 is highly overall correlated with 시설공량High correlation
시작용량 has 18 (12.5%) zerosZeros

Reproduction

Analysis started2024-03-23 06:50:05.045697
Analysis finished2024-03-23 06:50:11.944810
Duration6.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시작용량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct18
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean168981.77
Minimum0
Maximum2400000
Zeros18
Zeros (%)12.5%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-03-23T06:50:12.274224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q120
median500
Q350000
95-th percentile1000000
Maximum2400000
Range2400000
Interquartile range (IQR)49980

Descriptive statistics

Standard deviation511537.11
Coefficient of variation (CV)3.0271733
Kurtosis13.085697
Mean168981.77
Median Absolute Deviation (MAD)500
Skewness3.6914914
Sum24333375
Variance2.6167021 × 1011
MonotonicityNot monotonic
2024-03-23T06:50:12.980945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
0 18
 
12.5%
50000 12
 
8.3%
10000 9
 
6.2%
1000 9
 
6.2%
5 9
 
6.2%
120000 9
 
6.2%
5000 9
 
6.2%
500 9
 
6.2%
50 9
 
6.2%
10 6
 
4.2%
Other values (8) 45
31.2%
ValueCountFrequency (%)
0 18
12.5%
5 9
6.2%
10 6
 
4.2%
20 6
 
4.2%
50 9
6.2%
100 6
 
4.2%
200 6
 
4.2%
400 6
 
4.2%
500 9
6.2%
1000 9
6.2%
ValueCountFrequency (%)
2400000 6
4.2%
1000000 6
4.2%
500000 3
 
2.1%
120000 9
6.2%
100000 6
4.2%
50000 12
8.3%
10000 9
6.2%
5000 9
6.2%
1000 9
6.2%
500 9
6.2%

끝용량
Real number (ℝ)

HIGH CORRELATION 

Distinct18
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.250169 × 109
Minimum5
Maximum1 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-03-23T06:50:13.700293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile5
Q1175
median5000
Q3120000
95-th percentile1 × 1010
Maximum1 × 1010
Range1 × 1010
Interquartile range (IQR)119825

Descriptive statistics

Standard deviation3.3186685 × 109
Coefficient of variation (CV)2.654576
Kurtosis3.2976295
Mean1.250169 × 109
Median Absolute Deviation (MAD)4995
Skewness2.2917283
Sum1.8002433 × 1011
Variance1.1013561 × 1019
MonotonicityNot monotonic
2024-03-23T06:50:14.379620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
9999999999 18
 
12.5%
50000 12
 
8.3%
5 9
 
6.2%
10000 9
 
6.2%
1000 9
 
6.2%
50 9
 
6.2%
120000 9
 
6.2%
5000 9
 
6.2%
500 9
 
6.2%
10 6
 
4.2%
Other values (8) 45
31.2%
ValueCountFrequency (%)
5 9
6.2%
10 6
4.2%
20 6
4.2%
50 9
6.2%
100 6
4.2%
200 6
4.2%
400 6
4.2%
500 9
6.2%
1000 9
6.2%
5000 9
6.2%
ValueCountFrequency (%)
9999999999 18
12.5%
2400000 6
 
4.2%
1000000 6
 
4.2%
500000 3
 
2.1%
120000 9
6.2%
100000 6
 
4.2%
50000 12
8.3%
10000 9
6.2%
5000 9
6.2%
1000 9
6.2%

시설용량보정계수
Real number (ℝ)

HIGH CORRELATION 

Distinct10
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.4166667
Minimum0.6
Maximum2.8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-03-23T06:50:15.029969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.6
5-th percentile0.6
Q11
median1.2
Q31.6
95-th percentile2.8
Maximum2.8
Range2.2
Interquartile range (IQR)0.6

Descriptive statistics

Standard deviation0.64688603
Coefficient of variation (CV)0.45662543
Kurtosis-0.14017269
Mean1.4166667
Median Absolute Deviation (MAD)0.4
Skewness0.88136096
Sum204
Variance0.41846154
MonotonicityNot monotonic
2024-03-23T06:50:15.523441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1.0 18
12.5%
1.2 18
12.5%
1.6 18
12.5%
0.6 15
10.4%
0.8 15
10.4%
1.4 15
10.4%
2.2 15
10.4%
2.8 15
10.4%
1.1 12
8.3%
1.8 3
 
2.1%
ValueCountFrequency (%)
0.6 15
10.4%
0.8 15
10.4%
1.0 18
12.5%
1.1 12
8.3%
1.2 18
12.5%
1.4 15
10.4%
1.6 18
12.5%
1.8 3
 
2.1%
2.2 15
10.4%
2.8 15
10.4%
ValueCountFrequency (%)
2.8 15
10.4%
2.2 15
10.4%
1.8 3
 
2.1%
1.6 18
12.5%
1.4 15
10.4%
1.2 18
12.5%
1.1 12
8.3%
1.0 18
12.5%
0.8 15
10.4%
0.6 15
10.4%

검사방법아이디
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
유해법
48 
화관법정기수시
48 
화관법설치
48 

Length

Max length7
Median length5
Mean length5
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유해법
2nd row유해법
3rd row유해법
4th row유해법
5th row유해법

Common Values

ValueCountFrequency (%)
유해법 48
33.3%
화관법정기수시 48
33.3%
화관법설치 48
33.3%

Length

2024-03-23T06:50:16.683694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T06:50:17.230344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유해법 48
33.3%
화관법정기수시 48
33.3%
화관법설치 48
33.3%
Distinct6
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
실내저장시설
27 
실외저장시설
27 
실내보관시설
27 
실외보관시설
27 
제조사용시설
21 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제조사용시설
2nd row제조사용시설
3rd row제조사용시설
4th row제조사용시설
5th row제조사용시설

Common Values

ValueCountFrequency (%)
실내저장시설 27
18.8%
실외저장시설 27
18.8%
실내보관시설 27
18.8%
실외보관시설 27
18.8%
제조사용시설 21
14.6%
지하저장시설 15
10.4%

Length

2024-03-23T06:50:17.869915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T06:50:18.376409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
실내저장시설 27
18.8%
실외저장시설 27
18.8%
실내보관시설 27
18.8%
실외보관시설 27
18.8%
제조사용시설 21
14.6%
지하저장시설 15
10.4%

시설공량
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.69055764
Minimum0.1229
Maximum2.3333
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-03-23T06:50:18.866762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.1229
5-th percentile0.1708
Q10.2854
median0.4896
Q30.9896
95-th percentile1.375
Maximum2.3333
Range2.2104
Interquartile range (IQR)0.7042

Descriptive statistics

Standard deviation0.5312087
Coefficient of variation (CV)0.769246
Kurtosis2.1647994
Mean0.69055764
Median Absolute Deviation (MAD)0.2104
Skewness1.5480374
Sum99.4403
Variance0.28218268
MonotonicityNot monotonic
2024-03-23T06:50:19.512289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
0.2792 18
12.5%
0.2854 18
12.5%
0.4896 18
12.5%
0.4854 18
12.5%
1.375 18
12.5%
0.9896 18
12.5%
0.5125 7
 
4.9%
0.8563 7
 
4.9%
2.3333 7
 
4.9%
0.1229 5
 
3.5%
Other values (2) 10
6.9%
ValueCountFrequency (%)
0.1229 5
 
3.5%
0.1708 5
 
3.5%
0.2792 18
12.5%
0.2854 18
12.5%
0.3563 5
 
3.5%
0.4854 18
12.5%
0.4896 18
12.5%
0.5125 7
 
4.9%
0.8563 7
 
4.9%
0.9896 18
12.5%
ValueCountFrequency (%)
2.3333 7
 
4.9%
1.375 18
12.5%
0.9896 18
12.5%
0.8563 7
 
4.9%
0.5125 7
 
4.9%
0.4896 18
12.5%
0.4854 18
12.5%
0.3563 5
 
3.5%
0.2854 18
12.5%
0.2792 18
12.5%

Interactions

2024-03-23T06:50:09.682543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:05.498778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:06.828368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:08.491041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:09.954710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:05.908009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:07.442996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:08.773295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:10.280139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:06.244750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:07.899583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:09.073087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:10.515756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:06.562898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:08.198933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:50:09.371063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T06:50:19.892811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시작용량끝용량시설용량보정계수검사방법아이디시설타입아이디시설공량
시작용량1.0000.8730.9640.0000.3630.000
끝용량0.8731.0001.0000.0000.0000.000
시설용량보정계수0.9641.0001.0000.0000.0000.000
검사방법아이디0.0000.0000.0001.0000.0000.761
시설타입아이디0.3630.0000.0000.0001.0000.530
시설공량0.0000.0000.0000.7610.5301.000
2024-03-23T06:50:20.530530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설타입아이디검사방법아이디
시설타입아이디1.0000.000
검사방법아이디0.0001.000
2024-03-23T06:50:20.978455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시작용량끝용량시설용량보정계수시설공량검사방법아이디시설타입아이디
시작용량1.0000.9620.944-0.0520.0000.239
끝용량0.9621.0000.943-0.0780.0000.000
시설용량보정계수0.9440.9431.000-0.0990.0000.000
시설공량-0.052-0.078-0.0991.0000.7610.393
검사방법아이디0.0000.0000.0000.7611.0000.000
시설타입아이디0.2390.0000.0000.3930.0001.000

Missing values

2024-03-23T06:50:10.931777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T06:50:11.738595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시작용량끝용량시설용량보정계수검사방법아이디시설타입아이디시설공량
0050.6유해법제조사용시설0.5125
15500.8유해법제조사용시설0.5125
2505001.0유해법제조사용시설0.5125
350050001.2유해법제조사용시설0.5125
45000500001.4유해법제조사용시설0.5125
5500005000001.6유해법제조사용시설0.5125
650000099999999991.8유해법제조사용시설0.5125
7050.6유해법실내저장시설0.2792
85100.8유해법실내저장시설0.2792
910201.0유해법실내저장시설0.2792
시작용량끝용량시설용량보정계수검사방법아이디시설타입아이디시설공량
1341000100001.2화관법설치실외보관시설0.9896
135100001000001.4화관법설치실외보관시설0.9896
13610000010000001.6화관법설치실외보관시설0.9896
137100000024000002.2화관법설치실외보관시설0.9896
138240000099999999992.8화관법설치실외보관시설0.9896
139010001.0화관법설치지하저장시설0.3563
1401000100001.2화관법설치지하저장시설0.3563
14110000500001.6화관법설치지하저장시설0.3563
142500001200002.2화관법설치지하저장시설0.3563
14312000099999999992.8화관법설치지하저장시설0.3563