Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells20000
Missing cells (%)28.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory625.3 KiB
Average record size in memory64.0 B

Variable types

Numeric2
Categorical2
DateTime1
Unsupported2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15966/S/1/datasetView.do

Alerts

모델명 has constant value ""Constant
고유번호 is highly imbalanced (54.6%)Imbalance
악취저감장치 연속OFF시간 has 10000 (100.0%) missing valuesMissing
등록일시 has 10000 (100.0%) missing valuesMissing
기관 명 is highly skewed (γ1 = 29.18544191)Skewed
악취저감장치 연속OFF시간 is an unsupported type, check if it needs cleaning or further analysisUnsupported
등록일시 is an unsupported type, check if it needs cleaning or further analysisUnsupported
IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2) has 8131 (81.3%) zerosZeros

Reproduction

Analysis started2024-05-11 16:41:09.959797
Analysis finished2024-05-11 16:41:11.402146
Duration1.44 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관 명
Real number (ℝ)

SKEWED 

Distinct900
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean736.089
Minimum0
Maximum101475
Zeros7
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size107.7 KiB
2024-05-12T01:41:11.604992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile55
Q1268
median542
Q3878
95-th percentile2063.1
Maximum101475
Range101475
Interquartile range (IQR)610

Descriptive statistics

Standard deviation3378.003
Coefficient of variation (CV)4.589123
Kurtosis867.65496
Mean736.089
Median Absolute Deviation (MAD)301
Skewness29.185442
Sum7360890
Variance11410904
MonotonicityNot monotonic
2024-05-12T01:41:12.040607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
238 28
 
0.3%
873 24
 
0.2%
69 21
 
0.2%
236 20
 
0.2%
411 20
 
0.2%
1035 20
 
0.2%
260 20
 
0.2%
141 20
 
0.2%
328 20
 
0.2%
136 19
 
0.2%
Other values (890) 9788
97.9%
ValueCountFrequency (%)
0 7
0.1%
1 13
0.1%
2 7
0.1%
4 10
0.1%
5 11
0.1%
6 11
0.1%
7 11
0.1%
8 12
0.1%
9 5
 
0.1%
10 11
0.1%
ValueCountFrequency (%)
101475 11
0.1%
2117 7
0.1%
2116 6
 
0.1%
2114 11
0.1%
2113 3
 
< 0.1%
2112 14
0.1%
2111 15
0.1%
2110 14
0.1%
2108 5
 
0.1%
2107 11
0.1%

모델명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2024-05-12T01:41:12.440665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:41:12.727564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%

고유번호
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
1
9047 
0
953 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9047
90.5%
0 953
 
9.5%

Length

2024-05-12T01:41:13.030956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:41:13.327057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9047
90.5%
0 953
 
9.5%
Distinct1187
Distinct (%)11.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean587.4975
Minimum0
Maximum23554
Zeros8131
Zeros (%)81.3%
Negative0
Negative (%)0.0%
Memory size107.7 KiB
2024-05-12T01:41:13.656620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile3921.3
Maximum23554
Range23554
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2086.2698
Coefficient of variation (CV)3.5511127
Kurtosis31.788242
Mean587.4975
Median Absolute Deviation (MAD)0
Skewness5.0996931
Sum5874975
Variance4352521.9
MonotonicityNot monotonic
2024-05-12T01:41:14.070090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 8131
81.3%
60 13
 
0.1%
33 13
 
0.1%
35 12
 
0.1%
49 11
 
0.1%
29 11
 
0.1%
92 11
 
0.1%
44 10
 
0.1%
36 10
 
0.1%
81 9
 
0.1%
Other values (1177) 1769
 
17.7%
ValueCountFrequency (%)
0 8131
81.3%
2 3
 
< 0.1%
3 5
 
0.1%
4 3
 
< 0.1%
5 3
 
< 0.1%
6 4
 
< 0.1%
7 4
 
< 0.1%
8 2
 
< 0.1%
9 3
 
< 0.1%
10 2
 
< 0.1%
ValueCountFrequency (%)
23554 1
< 0.1%
23549 1
< 0.1%
23547 1
< 0.1%
23545 1
< 0.1%
23540 1
< 0.1%
23526 1
< 0.1%
23503 1
< 0.1%
23482 1
< 0.1%
18252 1
< 0.1%
18248 1
< 0.1%
Distinct9881
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
Minimum2024-03-18 00:09:24
Maximum2024-03-21 14:15:15
2024-05-12T01:41:14.445742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:41:14.854778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

악취저감장치 연속OFF시간
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size107.7 KiB

등록일시
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size107.7 KiB

Interactions

2024-05-12T01:41:10.628035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:41:10.322606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:41:10.783231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:41:10.475304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-12T01:41:15.109325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관 명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)
기관 명1.0000.0000.000
고유번호0.0001.0000.385
IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)0.0000.3851.000

Missing values

2024-05-12T01:41:10.981385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-12T01:41:11.192384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관 명모델명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시
서울시NTS1009531102024-03-21 00:22:28<NA><NA>
NTS1002561102024-03-19 00:24:57<NA><NA>
NTS1006341123732024-03-19 10:50:01<NA><NA>
NTS1001441102024-03-18 22:49:32<NA><NA>
NTS1006361102024-03-19 15:30:57<NA><NA>
NTS1002091102024-03-20 00:47:46<NA><NA>
NTS1005431102024-03-21 09:37:46<NA><NA>
NTS1008671102024-03-18 01:54:31<NA><NA>
NTS1001411102024-03-19 23:13:03<NA><NA>
NTS10010481102024-03-20 09:34:29<NA><NA>
기관 명모델명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시
서울시NTS1008801102024-03-20 16:33:04<NA><NA>
NTS1002071102024-03-18 13:28:42<NA><NA>
NTS1002361102024-03-18 11:55:33<NA><NA>
NTS1002291102024-03-18 04:06:48<NA><NA>
NTS1003101117782024-03-19 01:20:41<NA><NA>
NTS100235117032024-03-20 07:02:45<NA><NA>
NTS100531102024-03-21 01:43:58<NA><NA>
NTS1006391102024-03-21 10:26:48<NA><NA>
NTS1005871102024-03-21 03:23:57<NA><NA>
NTS1002011102024-03-20 01:34:29<NA><NA>