Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows317
Duplicate rows (%)3.2%
Total size in memory654.3 KiB
Average record size in memory67.0 B

Variable types

Categorical4
Unsupported1
Numeric1
DateTime1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15966/S/1/datasetView.do

Alerts

기관 명 has constant value ""Constant
모델명 has constant value ""Constant
IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2) has constant value ""Constant
Dataset has 317 (3.2%) duplicate rowsDuplicates
악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2) is highly imbalanced (56.1%)Imbalance
고유번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
악취저감장치 연속OFF시간 has 8339 (83.4%) zerosZeros

Reproduction

Analysis started2024-05-11 16:40:21.650102
Analysis finished2024-05-11 16:40:22.983300
Duration1.33 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관 명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서울시
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울시
2nd row서울시
3rd row서울시
4th row서울시
5th row서울시

Common Values

ValueCountFrequency (%)
서울시 10000
100.0%

Length

2024-05-12T01:40:23.174362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:40:23.460870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울시 10000
100.0%

모델명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
NTS100
10000 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNTS100
2nd rowNTS100
3rd rowNTS100
4th rowNTS100
5th rowNTS100

Common Values

ValueCountFrequency (%)
NTS100 10000
100.0%

Length

2024-05-12T01:40:23.759913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:40:24.045673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
nts100 10000
100.0%

고유번호
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size156.2 KiB
Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2024-05-12T01:40:24.344337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:40:24.632836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
9091 
0
 
909

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
1 9091
90.9%
0 909
 
9.1%

Length

2024-05-12T01:40:24.931326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:40:25.224562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9091
90.9%
0 909
 
9.1%

악취저감장치 연속OFF시간
Real number (ℝ)

ZEROS 

Distinct1104
Distinct (%)11.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean463.3125
Minimum0
Maximum22014
Zeros8339
Zeros (%)83.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-12T01:40:25.559803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2870.9
Maximum22014
Range22014
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1912.1964
Coefficient of variation (CV)4.1272282
Kurtosis42.590803
Mean463.3125
Median Absolute Deviation (MAD)0
Skewness5.9164054
Sum4633125
Variance3656495.1
MonotonicityNot monotonic
2024-05-12T01:40:25.970459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 8339
83.4%
138 8
 
0.1%
128 7
 
0.1%
141 7
 
0.1%
144 7
 
0.1%
27 6
 
0.1%
905 6
 
0.1%
906 6
 
0.1%
947 6
 
0.1%
901 6
 
0.1%
Other values (1094) 1602
 
16.0%
ValueCountFrequency (%)
0 8339
83.4%
2 1
 
< 0.1%
3 5
 
0.1%
4 5
 
0.1%
5 2
 
< 0.1%
6 5
 
0.1%
7 2
 
< 0.1%
8 2
 
< 0.1%
9 2
 
< 0.1%
10 3
 
< 0.1%
ValueCountFrequency (%)
22014 2
< 0.1%
22011 1
< 0.1%
22004 1
< 0.1%
21998 1
< 0.1%
21994 1
< 0.1%
21984 1
< 0.1%
21981 1
< 0.1%
21976 1
< 0.1%
21975 1
< 0.1%
21974 1
< 0.1%
Distinct9639
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2024-01-15 00:08:35
Maximum2024-01-17 13:15:48
2024-05-12T01:40:26.369614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:40:26.776581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-05-12T01:40:21.958546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-12T01:40:27.018340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간
악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)1.0000.459
악취저감장치 연속OFF시간0.4591.000
2024-05-12T01:40:27.249457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
악취저감장치 연속OFF시간악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)
악취저감장치 연속OFF시간1.0000.460
악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)0.4601.000

Missing values

2024-05-12T01:40:22.292749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-12T01:40:22.803148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관 명모델명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시
43138서울시NTS10000000009461102024-01-15 18:17:47
51241서울시NTS10000000002591102024-01-15 22:02:45
94106서울시NTS1003551102024-01-17 08:44:33
13793서울시NTS100000000037510312024-01-15 05:46:12
94167서울시NTS100977109692024-01-17 08:45:40
62436서울시NTS10000000009841102024-01-16 05:13:35
9312서울시NTS10000000001521102024-01-15 04:04:42
20145서울시NTS10000000000991102024-01-15 08:45:01
61545서울시NTS10000000010031102024-01-16 04:27:03
13119서울시NTS10000000002521102024-01-15 05:39:58
기관 명모델명고유번호IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시
21906서울시NTS10000000000661102024-01-15 09:31:20
1276서울시NTS10000000003401102024-01-15 00:20:17
70939서울시NTS1002071102024-01-16 13:05:11
20295서울시NTS10000000001991102024-01-15 08:46:24
31854서울시NTS10000000007101102024-01-15 13:33:22
64083서울시NTS1000000000804107992024-01-16 06:58:13
24033서울시NTS10000000002611102024-01-15 10:21:00
14057서울시NTS10000000009211102024-01-15 05:48:38
73358서울시NTS10010291010482024-01-16 14:49:58
5237서울시NTS10000000010681102024-01-15 01:56:55

Duplicate rows

Most frequently occurring

기관 명모델명IoT기기상태값(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치상태(꺼짐:0, 켜짐:1, 알수없음:2)악취저감장치 연속OFF시간등록일시# duplicates
63서울시NTS1001102024-01-15 02:31:425
73서울시NTS1001102024-01-15 02:36:475
58서울시NTS1001102024-01-15 02:29:304
64서울시NTS1001102024-01-15 02:32:274
60서울시NTS1001102024-01-15 02:30:153
62서울시NTS1001102024-01-15 02:31:133
69서울시NTS1001102024-01-15 02:34:553
70서울시NTS1001102024-01-15 02:35:243
71서울시NTS1001102024-01-15 02:35:483
282서울시NTS1001102024-01-17 02:21:373