Overview

Dataset statistics

Number of variables8
Number of observations279
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.1 KiB
Average record size in memory66.5 B

Variable types

Numeric2
Categorical3
DateTime2
Boolean1

Dataset

Description인천광역시 부평구 무인악취포집기 악취검사 내역입니다(사업장명, 채취일시, 측정값 등)ex) 1,하나아파트 갈산동,2014-01-28,2014-01-28,악취,3,Y
Author인천광역시 부평구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15051645&srcSe=7661IVAWM27C61E190

Alerts

검사항목 has constant value ""Constant
사업장명 is highly overall correlated with 소재지High correlation
소재지 is highly overall correlated with 사업장명High correlation
일련번호 is highly overall correlated with 측정값(배)High correlation
측정값(배) is highly overall correlated with 일련번호 and 1 other fieldsHigh correlation
검사결과 is highly overall correlated with 측정값(배)High correlation
검사결과 is highly imbalanced (87.0%)Imbalance
일련번호 has unique valuesUnique

Reproduction

Analysis started2024-01-28 06:24:53.902288
Analysis finished2024-01-28 06:24:54.502138
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct279
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean140
Minimum1
Maximum279
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2024-01-28T15:24:54.554909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile14.9
Q170.5
median140
Q3209.5
95-th percentile265.1
Maximum279
Range278
Interquartile range (IQR)139

Descriptive statistics

Standard deviation80.684571
Coefficient of variation (CV)0.57631836
Kurtosis-1.2
Mean140
Median Absolute Deviation (MAD)70
Skewness0
Sum39060
Variance6510
MonotonicityStrictly increasing
2024-01-28T15:24:54.650860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
185 1
 
0.4%
191 1
 
0.4%
190 1
 
0.4%
189 1
 
0.4%
188 1
 
0.4%
187 1
 
0.4%
186 1
 
0.4%
184 1
 
0.4%
193 1
 
0.4%
Other values (269) 269
96.4%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
279 1
0.4%
278 1
0.4%
277 1
0.4%
276 1
0.4%
275 1
0.4%
274 1
0.4%
273 1
0.4%
272 1
0.4%
271 1
0.4%
270 1
0.4%

사업장명
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
한국지엠㈜
65 
하나아파트
64 
서부사료㈜
53 
청천초등학교
32 
㈜지훈산업
18 
Other values (13)
47 

Length

Max length16
Median length5
Mean length5.4193548
Min length4

Unique

Unique6 ?
Unique (%)2.2%

Sample

1st row하나아파트
2nd row청천초교
3rd row하나아파트
4th row하나아파트
5th row청천초교

Common Values

ValueCountFrequency (%)
한국지엠㈜ 65
23.3%
하나아파트 64
22.9%
서부사료㈜ 53
19.0%
청천초등학교 32
11.5%
㈜지훈산업 18
 
6.5%
청천초교 10
 
3.6%
태화아파트 10
 
3.6%
동서식품㈜ 9
 
3.2%
인그리디언코리아(유)부평공장 6
 
2.2%
인천탁주제조제1공장 2
 
0.7%
Other values (8) 10
 
3.6%

Length

2024-01-28T15:24:54.765378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한국지엠㈜ 65
23.2%
하나아파트 64
22.9%
서부사료㈜ 53
18.9%
청천초등학교 32
11.4%
㈜지훈산업 18
 
6.4%
청천초교 10
 
3.6%
태화아파트 10
 
3.6%
동서식품㈜ 9
 
3.2%
인그리디언코리아(유)부평공장 6
 
2.1%
sh테크 2
 
0.7%
Other values (9) 11
 
3.9%

소재지
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
갈산동
148 
청천동
122 
십정동
 
9

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row갈산동
2nd row청천동
3rd row갈산동
4th row갈산동
5th row청천동

Common Values

ValueCountFrequency (%)
갈산동 148
53.0%
청천동 122
43.7%
십정동 9
 
3.2%

Length

2024-01-28T15:24:54.856346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T15:24:54.933208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
갈산동 148
53.0%
청천동 122
43.7%
십정동 9
 
3.2%
Distinct190
Distinct (%)68.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Minimum2014-01-28 00:00:00
Maximum2023-06-20 00:00:00
2024-01-28T15:24:55.012451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:24:55.109746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct182
Distinct (%)65.2%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Minimum2014-01-28 00:00:00
Maximum2023-06-20 00:00:00
2024-01-28T15:24:55.212272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:24:55.308399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

검사항목
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
악취
279 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row악취
2nd row악취
3rd row악취
4th row악취
5th row악취

Common Values

ValueCountFrequency (%)
악취 279
100.0%

Length

2024-01-28T15:24:55.399409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T15:24:55.462654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
악취 279
100.0%

측정값(배)
Real number (ℝ)

HIGH CORRELATION 

Distinct15
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean115.24014
Minimum3
Maximum6694
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2024-01-28T15:24:55.526389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile3
Q13
median100
Q3100
95-th percentile300
Maximum6694
Range6691
Interquartile range (IQR)97

Descriptive statistics

Standard deviation424.51427
Coefficient of variation (CV)3.683736
Kurtosis209.70841
Mean115.24014
Median Absolute Deviation (MAD)97
Skewness13.730523
Sum32152
Variance180212.36
MonotonicityNot monotonic
2024-01-28T15:24:55.615921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
3 117
41.9%
100 106
38.0%
144 16
 
5.7%
300 11
 
3.9%
208 9
 
3.2%
173 3
 
1.1%
669 3
 
1.1%
448 3
 
1.1%
5 2
 
0.7%
4 2
 
0.7%
Other values (5) 7
 
2.5%
ValueCountFrequency (%)
3 117
41.9%
4 2
 
0.7%
5 2
 
0.7%
6 1
 
0.4%
13 1
 
0.4%
100 106
38.0%
120 2
 
0.7%
144 16
 
5.7%
173 3
 
1.1%
208 9
 
3.2%
ValueCountFrequency (%)
6694 1
 
0.4%
1442 2
 
0.7%
669 3
 
1.1%
448 3
 
1.1%
300 11
 
3.9%
208 9
 
3.2%
173 3
 
1.1%
144 16
 
5.7%
120 2
 
0.7%
100 106
38.0%

검사결과
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size411.0 B
True
274 
False
 
5
ValueCountFrequency (%)
True 274
98.2%
False 5
 
1.8%
2024-01-28T15:24:55.698541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2024-01-28T15:24:54.216877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:24:54.082326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:24:54.279669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:24:54.147833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T15:24:55.743274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호사업장명소재지측정값(배)검사결과
일련번호1.0000.7400.2770.0000.228
사업장명0.7401.0001.0000.5850.473
소재지0.2771.0001.0000.0000.031
측정값(배)0.0000.5850.0001.0000.508
검사결과0.2280.4730.0310.5081.000
2024-01-28T15:24:55.815376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업장명소재지검사결과
사업장명1.0000.9650.363
소재지0.9651.0000.051
검사결과0.3630.0511.000
2024-01-28T15:24:55.879356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호측정값(배)사업장명소재지검사결과
일련번호1.0000.7200.3890.1690.172
측정값(배)0.7201.0000.3200.0000.768
사업장명0.3890.3201.0000.9650.363
소재지0.1690.0000.9651.0000.051
검사결과0.1720.7680.3630.0511.000

Missing values

2024-01-28T15:24:54.370219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T15:24:54.464584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호사업장명소재지채취일자의뢰일자검사항목측정값(배)검사결과
01하나아파트갈산동2014-01-282014-01-28악취3Y
12청천초교청천동2014-02-252014-02-25악취3Y
23하나아파트갈산동2014-02-272014-02-28악취3Y
34하나아파트갈산동2014-03-282014-03-28악취3Y
45청천초교청천동2014-03-282014-03-28악취3Y
56하나아파트갈산동2014-04-242014-04-24악취3Y
67청천초교청천동2014-05-152014-05-15악취3Y
78하나아파트갈산동2014-05-152014-05-15악취3Y
89하나아파트갈산동2014-05-212014-05-22악취3Y
910하나아파트갈산동2014-06-182014-06-18악취3Y
일련번호사업장명소재지채취일자의뢰일자검사항목측정값(배)검사결과
269270서부사료㈜갈산동2023-03-152023-03-16악취100Y
270271동서식품㈜청천동2023-03-162023-03-16악취100Y
271272한국지엠㈜청천동2023-03-162023-03-17악취208Y
272273한국지엠㈜청천동2023-03-162023-03-17악취100Y
273274한국지엠㈜청천동2023-03-162023-03-17악취100Y
274275한국지엠㈜청천동2023-05-172023-05-18악취208Y
275276한국지엠㈜청천동2023-05-172023-05-18악취100Y
276277한국지엠㈜청천동2023-06-202023-06-20악취100Y
277278한국지엠㈜청천동2023-06-202023-06-20악취120Y
278279한국지엠㈜청천동2023-06-202023-06-20악취100Y