Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells20000
Missing cells (%)28.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory625.3 KiB
Average record size in memory64.0 B

Variable types

Numeric2
Categorical2
DateTime1
Unsupported2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15966/S/1/datasetView.do

Alerts

모델명 has constant value ""Constant
고유번호 is highly imbalanced (54.7%)Imbalance
악취저감장치 연속OFF시간 has 10000 (100.0%) missing valuesMissing
등록일시 has 10000 (100.0%) missing valuesMissing
기관 명 is highly skewed (γ1 = 29.13506385)Skewed
악취저감장치 연속OFF시간 is an unsupported type, check if it needs cleaning or further analysisUnsupported
등록일시 is an unsupported type, check if it needs cleaning or further analysisUnsupported
IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2) has 8127 (81.3%) zerosZeros

Reproduction

Analysis started2024-05-11 16:41:02.256188
Analysis finished2024-05-11 16:41:04.305663
Duration2.05 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관 명
Real number (ℝ)

SKEWED 

Distinct900
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean749.1421
Minimum0
Maximum101475
Zeros13
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size107.7 KiB
2024-05-12T01:41:04.517291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile50
Q1276
median554
Q3889
95-th percentile2069
Maximum101475
Range101475
Interquartile range (IQR)613

Descriptive statistics

Standard deviation3379.5181
Coefficient of variation (CV)4.5111843
Kurtosis865.64444
Mean749.1421
Median Absolute Deviation (MAD)300
Skewness29.135064
Sum7491421
Variance11421142
MonotonicityNot monotonic
2024-05-12T01:41:04.949935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
238 30
 
0.3%
40 25
 
0.2%
276 24
 
0.2%
139 23
 
0.2%
977 22
 
0.2%
275 21
 
0.2%
23 21
 
0.2%
719 21
 
0.2%
861 20
 
0.2%
2105 20
 
0.2%
Other values (890) 9773
97.7%
ValueCountFrequency (%)
0 13
0.1%
1 10
0.1%
2 7
0.1%
4 8
0.1%
5 12
0.1%
6 8
0.1%
7 17
0.2%
8 9
0.1%
9 8
0.1%
10 11
0.1%
ValueCountFrequency (%)
101475 11
0.1%
2117 8
0.1%
2116 9
0.1%
2114 16
0.2%
2113 6
 
0.1%
2112 10
0.1%
2111 10
0.1%
2110 10
0.1%
2108 10
0.1%
2107 9
0.1%

모델명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2024-05-12T01:41:05.349998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:41:05.636475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%

고유번호
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
1
9051 
0
949 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9051
90.5%
0 949
 
9.5%

Length

2024-05-12T01:41:06.062177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:41:06.358958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9051
90.5%
0 949
 
9.5%
Distinct1254
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean614.5877
Minimum0
Maximum23389
Zeros8127
Zeros (%)81.3%
Negative0
Negative (%)0.0%
Memory size107.7 KiB
2024-05-12T01:41:06.687610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile4304.4
Maximum23389
Range23389
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2156.8942
Coefficient of variation (CV)3.5094978
Kurtosis29.828466
Mean614.5877
Median Absolute Deviation (MAD)0
Skewness4.9580347
Sum6145877
Variance4652192.4
MonotonicityNot monotonic
2024-05-12T01:41:07.103126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 8127
81.3%
12 15
 
0.1%
2 15
 
0.1%
4 14
 
0.1%
18 11
 
0.1%
44 11
 
0.1%
7 11
 
0.1%
25 11
 
0.1%
40 11
 
0.1%
13 10
 
0.1%
Other values (1244) 1764
 
17.6%
ValueCountFrequency (%)
0 8127
81.3%
2 15
 
0.1%
3 6
 
0.1%
4 14
 
0.1%
5 5
 
0.1%
6 9
 
0.1%
7 11
 
0.1%
8 10
 
0.1%
9 6
 
0.1%
10 10
 
0.1%
ValueCountFrequency (%)
23389 2
< 0.1%
23378 1
< 0.1%
23365 1
< 0.1%
23364 1
< 0.1%
23357 1
< 0.1%
23335 1
< 0.1%
23332 1
< 0.1%
23330 1
< 0.1%
23321 1
< 0.1%
18057 1
< 0.1%
Distinct9855
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
Minimum2024-03-11 00:00:09
Maximum2024-03-14 14:04:50
2024-05-12T01:41:07.493120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:41:07.924339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

악취저감장치 연속OFF시간
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size107.7 KiB

등록일시
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size107.7 KiB

Interactions

2024-05-12T01:41:03.129855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:41:02.623684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:41:03.384488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:41:02.876558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-12T01:41:08.185930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관 명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)
기관 명1.0000.0000.000
고유번호0.0001.0000.438
IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)0.0000.4381.000

Missing values

2024-05-12T01:41:03.722343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-12T01:41:04.129217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관 명모델명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시
서울시NTS100661102024-03-12 00:57:08<NA><NA>
NTS1004241102024-03-11 05:36:35<NA><NA>
NTS1001741102024-03-11 21:51:19<NA><NA>
NTS1001401122632024-03-11 22:37:36<NA><NA>
NTS1005171102024-03-12 15:52:06<NA><NA>
NTS10020561102024-03-13 17:12:19<NA><NA>
NTS100901102024-03-12 12:40:11<NA><NA>
NTS1005281102024-03-14 10:59:38<NA><NA>
NTS1003761102024-03-13 07:45:00<NA><NA>
NTS1009521102024-03-13 21:03:31<NA><NA>
기관 명모델명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시
서울시NTS1003641102024-03-12 06:35:57<NA><NA>
NTS1008071102024-03-13 13:59:44<NA><NA>
NTS1003891102024-03-11 14:53:51<NA><NA>
NTS10011001102024-03-12 09:00:04<NA><NA>
NTS1005931122912024-03-13 03:35:51<NA><NA>
NTS10010291024232024-03-13 21:51:36<NA><NA>
NTS10010251102024-03-13 22:38:21<NA><NA>
NTS100910119742024-03-13 13:14:41<NA><NA>
NTS1008431102024-03-12 10:29:28<NA><NA>
NTS1002341102024-03-11 23:25:56<NA><NA>