Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells20000
Missing cells (%)28.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory625.3 KiB
Average record size in memory64.0 B

Variable types

Numeric2
Categorical2
DateTime1
Unsupported2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15966/S/1/datasetView.do

Alerts

모델명 has constant value ""Constant
고유번호 is highly imbalanced (59.5%)Imbalance
악취저감장치 연속OFF시간 has 10000 (100.0%) missing valuesMissing
등록일시 has 10000 (100.0%) missing valuesMissing
기관 명 is highly skewed (γ1 = 33.81433921)Skewed
악취저감장치 연속OFF시간 is an unsupported type, check if it needs cleaning or further analysisUnsupported
등록일시 is an unsupported type, check if it needs cleaning or further analysisUnsupported
IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2) has 8384 (83.8%) zerosZeros

Reproduction

Analysis started2024-05-17 22:15:46.793426
Analysis finished2024-05-17 22:15:49.837633
Duration3.04 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관 명
Real number (ℝ)

SKEWED 

Distinct900
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean711.3988
Minimum0
Maximum101475
Zeros10
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size107.7 KiB
2024-05-18T07:15:50.069759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile47
Q1273
median555
Q3876
95-th percentile2066
Maximum101475
Range101475
Interquartile range (IQR)603

Descriptive statistics

Standard deviation2893.0924
Coefficient of variation (CV)4.066766
Kurtosis1175.0413
Mean711.3988
Median Absolute Deviation (MAD)298
Skewness33.814339
Sum7113988
Variance8369983.7
MonotonicityNot monotonic
2024-05-18T07:15:50.670322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
23 25
 
0.2%
40 23
 
0.2%
69 23
 
0.2%
36 23
 
0.2%
238 22
 
0.2%
812 21
 
0.2%
629 21
 
0.2%
620 20
 
0.2%
838 20
 
0.2%
276 20
 
0.2%
Other values (890) 9782
97.8%
ValueCountFrequency (%)
0 10
0.1%
1 17
0.2%
2 9
0.1%
4 10
0.1%
5 10
0.1%
6 11
0.1%
7 9
0.1%
8 7
0.1%
9 12
0.1%
10 13
0.1%
ValueCountFrequency (%)
101475 8
0.1%
2117 13
0.1%
2116 9
0.1%
2114 5
 
0.1%
2113 15
0.1%
2112 7
0.1%
2111 9
0.1%
2110 11
0.1%
2108 10
0.1%
2107 12
0.1%

모델명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2024-05-18T07:15:51.204631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T07:15:51.535185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%

고유번호
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
1
9193 
0
 
807

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9193
91.9%
0 807
 
8.1%

Length

2024-05-18T07:15:51.941271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T07:15:52.509175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9193
91.9%
0 807
 
8.1%
Distinct1076
Distinct (%)10.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean570.7267
Minimum0
Maximum23871
Zeros8384
Zeros (%)83.8%
Negative0
Negative (%)0.0%
Memory size107.7 KiB
2024-05-18T07:15:52.896371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile3397.05
Maximum23871
Range23871
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2156.3727
Coefficient of variation (CV)3.7782929
Kurtosis28.659467
Mean570.7267
Median Absolute Deviation (MAD)0
Skewness4.9952581
Sum5707267
Variance4649943
MonotonicityNot monotonic
2024-05-18T07:15:53.444906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 8384
83.8%
24 14
 
0.1%
7 13
 
0.1%
3 13
 
0.1%
13 12
 
0.1%
21 12
 
0.1%
10 11
 
0.1%
5 11
 
0.1%
17 10
 
0.1%
45 10
 
0.1%
Other values (1066) 1510
 
15.1%
ValueCountFrequency (%)
0 8384
83.8%
2 5
 
0.1%
3 13
 
0.1%
4 10
 
0.1%
5 11
 
0.1%
6 9
 
0.1%
7 13
 
0.1%
8 9
 
0.1%
9 10
 
0.1%
10 11
 
0.1%
ValueCountFrequency (%)
23871 1
< 0.1%
23870 1
< 0.1%
23842 1
< 0.1%
23836 1
< 0.1%
23833 1
< 0.1%
18565 1
< 0.1%
18556 2
< 0.1%
18547 1
< 0.1%
18542 1
< 0.1%
18522 1
< 0.1%
Distinct9832
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size97.9 KiB
Minimum2024-04-01 00:00:08
Maximum2024-04-04 14:05:42
2024-05-18T07:15:53.843221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:15:54.401630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

악취저감장치 연속OFF시간
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size107.7 KiB

등록일시
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size107.7 KiB

Interactions

2024-05-18T07:15:48.286819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:15:47.581872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:15:48.630785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T07:15:47.979775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-18T07:15:54.668679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관 명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)
기관 명1.0000.0000.000
고유번호0.0001.0000.482
IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)0.0000.4821.000

Missing values

2024-05-18T07:15:49.139657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-18T07:15:49.571553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관 명모델명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시
서울시NTS1003061102024-04-02 23:35:20<NA><NA>
NTS1003241102024-04-01 03:05:54<NA><NA>
NTS1002081102024-04-03 04:06:31<NA><NA>
NTS1005151102024-04-04 08:27:31<NA><NA>
NTS1006681102024-04-03 20:35:45<NA><NA>
NTS1008571102024-04-01 20:15:09<NA><NA>
NTS10038010103972024-04-04 04:25:21<NA><NA>
NTS1001831102024-04-02 03:54:34<NA><NA>
NTS1002911102024-04-04 07:39:04<NA><NA>
NTS10020771102024-04-01 01:35:25<NA><NA>
기관 명모델명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시
서울시NTS10020841110192024-04-04 10:58:11<NA><NA>
NTS1001091102024-04-03 01:44:38<NA><NA>
NTS1006231102024-04-02 22:43:35<NA><NA>
NTS10020521102024-04-03 03:32:27<NA><NA>
NTS1001014751102024-04-02 12:43:58<NA><NA>
NTS100231102024-04-01 05:20:51<NA><NA>
NTS1001821102024-04-02 03:07:44<NA><NA>
NTS1004011113562024-04-02 11:45:39<NA><NA>
NTS1005711102024-04-04 03:35:28<NA><NA>
NTS10020931102024-04-04 11:45:08<NA><NA>