Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells20000
Missing cells (%)28.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory625.3 KiB
Average record size in memory64.0 B

Variable types

Numeric2
Categorical2
DateTime1
Unsupported2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15966/S/1/datasetView.do

Alerts

모델명 has constant value ""Constant
고유번호 is highly imbalanced (56.7%)Imbalance
악취저감장치 연속OFF시간 has 10000 (100.0%) missing valuesMissing
등록일시 has 10000 (100.0%) missing valuesMissing
기관 명 is highly skewed (γ1 = 27.9970268)Skewed
악취저감장치 연속OFF시간 is an unsupported type, check if it needs cleaning or further analysisUnsupported
등록일시 is an unsupported type, check if it needs cleaning or further analysisUnsupported
IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2) has 8197 (82.0%) zerosZeros

Reproduction

Analysis started2024-05-11 16:41:16.895856
Analysis finished2024-05-11 16:41:18.884014
Duration1.99 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관 명
Real number (ℝ)

SKEWED 

Distinct900
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean752.0757
Minimum0
Maximum101475
Zeros10
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size107.7 KiB
2024-05-12T01:41:19.097399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile51
Q1271
median553
Q3888
95-th percentile2065
Maximum101475
Range101475
Interquartile range (IQR)617

Descriptive statistics

Standard deviation3525.3617
Coefficient of variation (CV)4.6875091
Kurtosis797.17358
Mean752.0757
Median Absolute Deviation (MAD)303
Skewness27.997027
Sum7520757
Variance12428175
MonotonicityNot monotonic
2024-05-12T01:41:19.528717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
23 26
 
0.3%
275 23
 
0.2%
69 22
 
0.2%
957 22
 
0.2%
27 21
 
0.2%
873 21
 
0.2%
276 20
 
0.2%
659 20
 
0.2%
47 19
 
0.2%
948 19
 
0.2%
Other values (890) 9787
97.9%
ValueCountFrequency (%)
0 10
0.1%
1 11
0.1%
2 16
0.2%
4 12
0.1%
5 10
0.1%
6 13
0.1%
7 8
0.1%
8 3
 
< 0.1%
9 7
0.1%
10 11
0.1%
ValueCountFrequency (%)
101475 12
0.1%
2117 11
0.1%
2116 9
0.1%
2114 14
0.1%
2113 9
0.1%
2112 15
0.1%
2111 8
0.1%
2110 12
0.1%
2108 12
0.1%
2107 15
0.1%

모델명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2024-05-12T01:41:19.927049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:41:20.210992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%

고유번호
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
1
9110 
0
 
890

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row0
5th row1

Common Values

ValueCountFrequency (%)
1 9110
91.1%
0 890
 
8.9%

Length

2024-05-12T01:41:20.511863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:41:20.808300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9110
91.1%
0 890
 
8.9%
Distinct1167
Distinct (%)11.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean620.7544
Minimum0
Maximum23721
Zeros8197
Zeros (%)82.0%
Negative0
Negative (%)0.0%
Memory size107.7 KiB
2024-05-12T01:41:21.135965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile3429.6
Maximum23721
Range23721
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2284.9898
Coefficient of variation (CV)3.6809885
Kurtosis33.153066
Mean620.7544
Median Absolute Deviation (MAD)0
Skewness5.2344957
Sum6207544
Variance5221178.3
MonotonicityNot monotonic
2024-05-12T01:41:21.548529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 8197
82.0%
11 14
 
0.1%
52 12
 
0.1%
60 11
 
0.1%
13 11
 
0.1%
45 10
 
0.1%
16 10
 
0.1%
2 9
 
0.1%
33 9
 
0.1%
9 9
 
0.1%
Other values (1157) 1708
 
17.1%
ValueCountFrequency (%)
0 8197
82.0%
2 9
 
0.1%
3 4
 
< 0.1%
4 7
 
0.1%
5 5
 
0.1%
6 9
 
0.1%
7 6
 
0.1%
8 5
 
0.1%
9 9
 
0.1%
10 7
 
0.1%
ValueCountFrequency (%)
23721 1
< 0.1%
23716 2
< 0.1%
23705 1
< 0.1%
23702 1
< 0.1%
23701 1
< 0.1%
23697 1
< 0.1%
23692 1
< 0.1%
23688 1
< 0.1%
23681 1
< 0.1%
23676 1
< 0.1%
Distinct9880
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
Minimum2024-03-25 00:22:30
Maximum2024-03-28 14:38:29
2024-05-12T01:41:21.872971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:41:22.112957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

악취저감장치 연속OFF시간
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size107.7 KiB

등록일시
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size107.7 KiB

Interactions

2024-05-12T01:41:17.717151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:41:17.363698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:41:17.973452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:41:17.565259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-12T01:41:22.275146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관 명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)
기관 명1.0000.0000.000
고유번호0.0001.0000.433
IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)0.0000.4331.000

Missing values

2024-05-12T01:41:18.309436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-12T01:41:18.709589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관 명모델명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시
서울시NTS1002321102024-03-28 06:04:16<NA><NA>
NTS1008681102024-03-28 05:13:38<NA><NA>
NTS1003211102024-03-26 08:43:54<NA><NA>
NTS10057210432024-03-25 17:50:18<NA><NA>
NTS1008861102024-03-28 14:01:06<NA><NA>
NTS1004601125772024-03-25 02:01:18<NA><NA>
NTS1007661102024-03-25 04:26:30<NA><NA>
NTS1001561102024-03-26 02:20:39<NA><NA>
NTS10039710282024-03-27 03:25:43<NA><NA>
NTS1005581102024-03-26 12:33:52<NA><NA>
기관 명모델명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시
서울시NTS1002151102024-03-27 00:12:36<NA><NA>
NTS1009321102024-03-28 04:27:52<NA><NA>
NTS100481102024-03-26 08:33:53<NA><NA>
NTS1009621102024-03-26 11:53:30<NA><NA>
NTS1002911102024-03-26 17:59:18<NA><NA>
NTS1008421125802024-03-25 04:27:44<NA><NA>
NTS1008461102024-03-27 18:17:51<NA><NA>
NTS1006031102024-03-28 02:02:18<NA><NA>
NTS1003121102024-03-28 02:01:48<NA><NA>
NTS1004001102024-03-28 00:29:54<NA><NA>