Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells20000
Missing cells (%)28.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory625.3 KiB
Average record size in memory64.0 B

Variable types

Numeric2
Categorical2
DateTime1
Unsupported2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15966/S/1/datasetView.do

Alerts

모델명 has constant value ""Constant
고유번호 is highly imbalanced (58.0%)Imbalance
악취저감장치 연속OFF시간 has 10000 (100.0%) missing valuesMissing
등록일시 has 10000 (100.0%) missing valuesMissing
기관 명 is highly skewed (γ1 = 30.50583474)Skewed
악취저감장치 연속OFF시간 is an unsupported type, check if it needs cleaning or further analysisUnsupported
등록일시 is an unsupported type, check if it needs cleaning or further analysisUnsupported
IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2) has 8230 (82.3%) zerosZeros

Reproduction

Analysis started2024-05-11 16:40:51.370524
Analysis finished2024-05-11 16:40:52.803186
Duration1.43 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관 명
Real number (ℝ)

SKEWED 

Distinct900
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean730.526
Minimum0
Maximum101475
Zeros7
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size107.7 KiB
2024-05-12T01:40:52.941361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile50.95
Q1273
median551.5
Q3876
95-th percentile2066
Maximum101475
Range101475
Interquartile range (IQR)603

Descriptive statistics

Standard deviation3224.6548
Coefficient of variation (CV)4.4141548
Kurtosis950.3594
Mean730.526
Median Absolute Deviation (MAD)299
Skewness30.505835
Sum7305260
Variance10398399
MonotonicityNot monotonic
2024-05-12T01:40:53.190690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
873 27
 
0.3%
543 25
 
0.2%
69 24
 
0.2%
23 24
 
0.2%
238 24
 
0.2%
815 23
 
0.2%
966 21
 
0.2%
92 21
 
0.2%
97 20
 
0.2%
276 20
 
0.2%
Other values (890) 9771
97.7%
ValueCountFrequency (%)
0 7
 
0.1%
1 14
0.1%
2 14
0.1%
4 18
0.2%
5 13
0.1%
6 10
0.1%
7 11
0.1%
8 10
0.1%
9 7
 
0.1%
10 10
0.1%
ValueCountFrequency (%)
101475 10
0.1%
2117 12
0.1%
2116 16
0.2%
2114 14
0.1%
2113 11
0.1%
2112 15
0.1%
2111 15
0.1%
2110 11
0.1%
2108 15
0.1%
2107 8
0.1%

모델명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2024-05-12T01:40:53.411036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:40:53.563920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%

고유번호
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
1
9148 
0
 
852

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9148
91.5%
0 852
 
8.5%

Length

2024-05-12T01:40:53.724944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:40:53.883645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9148
91.5%
0 852
 
8.5%
Distinct1233
Distinct (%)12.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean567.1271
Minimum0
Maximum23054
Zeros8230
Zeros (%)82.3%
Negative0
Negative (%)0.0%
Memory size107.7 KiB
2024-05-12T01:40:54.073829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile3481.1
Maximum23054
Range23054
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2097.6984
Coefficient of variation (CV)3.6988154
Kurtosis36.107338
Mean567.1271
Median Absolute Deviation (MAD)0
Skewness5.4290095
Sum5671271
Variance4400338.7
MonotonicityNot monotonic
2024-05-12T01:40:54.311372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 8230
82.3%
12 12
 
0.1%
6 10
 
0.1%
4 9
 
0.1%
13 8
 
0.1%
50 8
 
0.1%
21 8
 
0.1%
25 7
 
0.1%
57 7
 
0.1%
23 7
 
0.1%
Other values (1223) 1694
 
16.9%
ValueCountFrequency (%)
0 8230
82.3%
2 2
 
< 0.1%
3 6
 
0.1%
4 9
 
0.1%
5 6
 
0.1%
6 10
 
0.1%
7 3
 
< 0.1%
8 6
 
0.1%
9 4
 
< 0.1%
10 5
 
0.1%
ValueCountFrequency (%)
23054 1
< 0.1%
23050 1
< 0.1%
23039 1
< 0.1%
23038 1
< 0.1%
23028 1
< 0.1%
23024 1
< 0.1%
23020 1
< 0.1%
23019 1
< 0.1%
23017 1
< 0.1%
23006 1
< 0.1%
Distinct9863
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
Minimum2024-02-26 00:00:25
Maximum2024-02-29 14:53:05
2024-05-12T01:40:54.539675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:40:54.783311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

악취저감장치 연속OFF시간
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size107.7 KiB

등록일시
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size107.7 KiB

Interactions

2024-05-12T01:40:52.108954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:40:51.707620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:40:52.303041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:40:51.927862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-12T01:40:55.067419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관 명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)
기관 명1.0000.0000.000
고유번호0.0001.0000.430
IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)0.0000.4301.000

Missing values

2024-05-12T01:40:52.500013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-12T01:40:52.708794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관 명모델명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시
서울시NTS1001901102024-02-27 00:11:03<NA><NA>
NTS1009971102024-02-28 04:27:40<NA><NA>
NTS10010411102024-02-27 07:24:03<NA><NA>
NTS1001971102024-02-26 23:24:23<NA><NA>
NTS1008211102024-02-26 02:28:50<NA><NA>
NTS1003431102024-02-26 12:37:43<NA><NA>
NTS1007141102024-02-29 00:07:05<NA><NA>
NTS1005231102024-02-28 01:59:39<NA><NA>
NTS1002601102024-02-28 23:14:57<NA><NA>
NTS1008541102024-02-26 21:59:13<NA><NA>
기관 명모델명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시
서울시NTS1003911102024-02-28 15:27:25<NA><NA>
NTS100991102024-02-27 14:59:30<NA><NA>
NTS1003061102024-02-27 01:54:38<NA><NA>
NTS1009661102024-02-26 21:14:18<NA><NA>
NTS1006391102024-02-28 22:32:18<NA><NA>
NTS100281102024-02-28 20:04:10<NA><NA>
NTS1004471102024-02-28 01:11:35<NA><NA>
NTS1008081102024-02-28 00:30:24<NA><NA>
NTS10094710132024-02-27 12:03:25<NA><NA>
NTS1009501102024-02-29 02:31:32<NA><NA>