Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows78
Duplicate rows (%)0.8%
Total size in memory654.3 KiB
Average record size in memory67.0 B

Variable types

Categorical4
Unsupported1
Numeric1
DateTime1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15966/S/1/datasetView.do

Alerts

기관 명 has constant value ""Constant
모델명 has constant value ""Constant
IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2) has constant value ""Constant
Dataset has 78 (0.8%) duplicate rowsDuplicates
악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2) is highly imbalanced (52.5%)Imbalance
고유번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
악취저감장치 연속OFF시간 has 8118 (81.2%) zerosZeros

Reproduction

Analysis started2024-05-11 16:40:02.773450
Analysis finished2024-05-11 16:40:05.254147
Duration2.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관 명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서울시
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울시
2nd row서울시
3rd row서울시
4th row서울시
5th row서울시

Common Values

ValueCountFrequency (%)
서울시 10000
100.0%

Length

2024-05-12T01:40:05.450506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:40:05.752241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울시 10000
100.0%

모델명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
NTS100
10000 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNTS100
2nd rowNTS100
3rd rowNTS100
4th rowNTS100
5th rowNTS100

Common Values

ValueCountFrequency (%)
NTS100 10000
100.0%

Length

2024-05-12T01:40:06.056736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:40:06.341534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
nts100 10000
100.0%

고유번호
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size156.2 KiB
Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2024-05-12T01:40:06.644935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:40:06.930633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
8981 
0
1019 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 8981
89.8%
0 1019
 
10.2%

Length

2024-05-12T01:40:07.231478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:40:07.528775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 8981
89.8%
0 1019
 
10.2%

악취저감장치 연속OFF시간
Real number (ℝ)

ZEROS 

Distinct1071
Distinct (%)10.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean429.2965
Minimum0
Maximum21511
Zeros8118
Zeros (%)81.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-12T01:40:07.872827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2455.9
Maximum21511
Range21511
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1850.9169
Coefficient of variation (CV)4.3115117
Kurtosis43.663671
Mean429.2965
Median Absolute Deviation (MAD)0
Skewness6.0456315
Sum4292965
Variance3425893.3
MonotonicityNot monotonic
2024-05-12T01:40:08.285876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 8118
81.2%
405 10
 
0.1%
410 9
 
0.1%
439 9
 
0.1%
408 8
 
0.1%
444 8
 
0.1%
74 8
 
0.1%
438 8
 
0.1%
96 7
 
0.1%
422 7
 
0.1%
Other values (1061) 1808
 
18.1%
ValueCountFrequency (%)
0 8118
81.2%
2 5
 
0.1%
3 2
 
< 0.1%
4 3
 
< 0.1%
5 5
 
0.1%
6 6
 
0.1%
7 2
 
< 0.1%
8 1
 
< 0.1%
9 2
 
< 0.1%
10 3
 
< 0.1%
ValueCountFrequency (%)
21511 1
< 0.1%
21507 1
< 0.1%
21505 1
< 0.1%
21504 1
< 0.1%
21498 1
< 0.1%
21488 1
< 0.1%
21487 1
< 0.1%
21486 1
< 0.1%
21481 1
< 0.1%
21472 1
< 0.1%
Distinct9811
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-12-25 00:20:25
Maximum2023-12-27 09:02:13
2024-05-12T01:40:08.682221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:40:09.131690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-05-12T01:40:04.295367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-12T01:40:09.406197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간
악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)1.0000.418
악취저감장치 연속OFF시간0.4181.000
2024-05-12T01:40:09.641389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
악취저감장치 연속OFF시간악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)
악취저감장치 연속OFF시간1.0000.418
악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)0.4181.000

Missing values

2024-05-12T01:40:04.644060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-12T01:40:05.055245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관 명모델명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시
59819서울시NTS10000000010871102023-12-27 03:32:28
50818서울시NTS10000000021031102023-12-26 19:45:34
16960서울시NTS10000000003641102023-12-25 14:47:48
57122서울시NTS10000000003081102023-12-27 01:12:20
42197서울시NTS10000000006491102023-12-26 12:35:52
15480서울시NTS10000000000701102023-12-25 13:50:19
50724서울시NTS10000000010661193562023-12-26 19:43:49
47335서울시NTS10000000001751102023-12-26 17:11:03
13313서울시NTS1000000000861103372023-12-25 11:40:05
7616서울시NTS10000000005621102023-12-25 06:54:18
기관 명모델명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시
36249서울시NTS10000000020521102023-12-26 07:15:39
27516서울시NTS10000000004181102023-12-26 00:03:49
44080서울시NTS10000000007371102023-12-26 14:10:45
56149서울시NTS10000000010391102023-12-27 00:24:17
52449서울시NTS10000000009621102023-12-26 21:15:47
31928서울시NTS10000000002071102023-12-26 03:55:33
45844서울시NTS10000000006651102023-12-26 15:43:26
1681서울시NTS10000000003861022023-12-25 01:21:30
59408서울시NTS10000000006101102023-12-27 03:24:50
34058서울시NTS10000000006871102023-12-26 05:35:02

Duplicate rows

Most frequently occurring

기관 명모델명IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시# duplicates
52서울시NTS1001102023-12-27 02:33:126
8서울시NTS1001102023-12-25 02:07:515
57서울시NTS1001102023-12-27 02:35:315
74서울시NTS1001102023-12-27 02:45:305
13서울시NTS1001102023-12-25 02:09:004
44서울시NTS1001102023-12-26 02:32:514
49서울시NTS1001102023-12-27 02:32:264
56서울시NTS1001102023-12-27 02:35:114
65서울시NTS1001102023-12-27 02:41:184
66서울시NTS1001102023-12-27 02:41:584