Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows503
Duplicate rows (%)5.0%
Total size in memory752.0 KiB
Average record size in memory77.0 B

Variable types

Categorical4
Text1
Numeric2
DateTime1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15962/S/1/datasetView.do

Alerts

기관 명 has constant value ""Constant
모델명 has constant value ""Constant
전원상태(꺼짐:0, 켜짐 :1) has constant value ""Constant
Dataset has 503 (5.0%) duplicate rowsDuplicates
사용전력(W) has 4720 (47.2%) zerosZeros

Reproduction

Analysis started2024-05-04 00:23:09.295182
Analysis finished2024-05-04 00:23:11.906208
Duration2.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관 명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
양천구
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양천구
2nd row양천구
3rd row양천구
4th row양천구
5th row양천구

Common Values

ValueCountFrequency (%)
양천구 10000
100.0%

Length

2024-05-04T00:23:12.060846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T00:23:12.258207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양천구 10000
100.0%

모델명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
340
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row340
2nd row340
3rd row340
4th row340
5th row340

Common Values

ValueCountFrequency (%)
340 10000
100.0%

Length

2024-05-04T00:23:12.556334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T00:23:12.802724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
340 10000
100.0%
Distinct74
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-04T00:23:13.256921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters100000
Distinct characters16
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row6a309a539d
2nd rowc11fce50b9
3rd rowa8080bd747
4th rowc559c3b49d
5th rowea961e3698
ValueCountFrequency (%)
e1ba1dbe08 1755
 
17.5%
de28c29733 534
 
5.3%
ba77c59645 301
 
3.0%
08ccb88d44 290
 
2.9%
a8080bd747 205
 
2.1%
c11fce50b9 196
 
2.0%
7abcf2427a 191
 
1.9%
6a309a539d 156
 
1.6%
7540803cd5 155
 
1.6%
ea961e3698 155
 
1.6%
Other values (64) 6062
60.6%
2024-05-04T00:23:14.023722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 8166
 
8.2%
b 8132
 
8.1%
8 7829
 
7.8%
d 7706
 
7.7%
a 7462
 
7.5%
1 7265
 
7.3%
7 6641
 
6.6%
0 6230
 
6.2%
5 6028
 
6.0%
3 5647
 
5.6%
Other values (6) 28894
28.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 59572
59.6%
Lowercase Letter 40428
40.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 7829
13.1%
1 7265
12.2%
7 6641
11.1%
0 6230
10.5%
5 6028
10.1%
3 5647
9.5%
2 5504
9.2%
9 5481
9.2%
4 4531
7.6%
6 4416
7.4%
Lowercase Letter
ValueCountFrequency (%)
e 8166
20.2%
b 8132
20.1%
d 7706
19.1%
a 7462
18.5%
c 4970
12.3%
f 3992
9.9%

Most occurring scripts

ValueCountFrequency (%)
Common 59572
59.6%
Latin 40428
40.4%

Most frequent character per script

Common
ValueCountFrequency (%)
8 7829
13.1%
1 7265
12.2%
7 6641
11.1%
0 6230
10.5%
5 6028
10.1%
3 5647
9.5%
2 5504
9.2%
9 5481
9.2%
4 4531
7.6%
6 4416
7.4%
Latin
ValueCountFrequency (%)
e 8166
20.2%
b 8132
20.1%
d 7706
19.1%
a 7462
18.5%
c 4970
12.3%
f 3992
9.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 8166
 
8.2%
b 8132
 
8.1%
8 7829
 
7.8%
d 7706
 
7.7%
a 7462
 
7.5%
1 7265
 
7.3%
7 6641
 
6.6%
0 6230
 
6.2%
5 6028
 
6.0%
3 5647
 
5.6%
Other values (6) 28894
28.9%

가동유무
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
5244 
1
4756 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row0
5th row1

Common Values

ValueCountFrequency (%)
0 5244
52.4%
1 4756
47.6%

Length

2024-05-04T00:23:14.441390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T00:23:14.743142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 5244
52.4%
1 4756
47.6%

사용전력(W)
Real number (ℝ)

ZEROS 

Distinct268
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.6653
Minimum0
Maximum2088
Zeros4720
Zeros (%)47.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T00:23:15.064006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q331
95-th percentile91
Maximum2088
Range2088
Interquartile range (IQR)31

Descriptive statistics

Standard deviation109.55333
Coefficient of variation (CV)3.4597283
Kurtosis117.03903
Mean31.6653
Median Absolute Deviation (MAD)2
Skewness9.9085143
Sum316653
Variance12001.933
MonotonicityNot monotonic
2024-05-04T00:23:15.323376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 4720
47.2%
6 334
 
3.3%
1 267
 
2.7%
30 212
 
2.1%
29 187
 
1.9%
13 180
 
1.8%
63 178
 
1.8%
23 176
 
1.8%
15 168
 
1.7%
33 161
 
1.6%
Other values (258) 3417
34.2%
ValueCountFrequency (%)
0 4720
47.2%
1 267
 
2.7%
2 28
 
0.3%
3 100
 
1.0%
4 22
 
0.2%
5 9
 
0.1%
6 334
 
3.3%
7 45
 
0.4%
8 69
 
0.7%
9 4
 
< 0.1%
ValueCountFrequency (%)
2088 1
 
< 0.1%
1650 1
 
< 0.1%
1649 4
< 0.1%
1645 2
< 0.1%
1644 1
 
< 0.1%
1643 1
 
< 0.1%
1642 2
< 0.1%
1437 1
 
< 0.1%
1425 1
 
< 0.1%
1423 1
 
< 0.1%

빛의밝기(%)
Real number (ℝ)

Distinct64
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.6037
Minimum20
Maximum98
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T00:23:15.681059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile20
Q120
median20
Q324
95-th percentile38
Maximum98
Range78
Interquartile range (IQR)4

Descriptive statistics

Standard deviation7.5953812
Coefficient of variation (CV)0.32178774
Kurtosis19.293881
Mean23.6037
Median Absolute Deviation (MAD)0
Skewness3.7597234
Sum236037
Variance57.689815
MonotonicityNot monotonic
2024-05-04T00:23:16.142164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20 5526
55.3%
21 789
 
7.9%
23 599
 
6.0%
22 560
 
5.6%
24 412
 
4.1%
25 237
 
2.4%
28 228
 
2.3%
29 226
 
2.3%
30 151
 
1.5%
26 122
 
1.2%
Other values (54) 1150
 
11.5%
ValueCountFrequency (%)
20 5526
55.3%
21 789
 
7.9%
22 560
 
5.6%
23 599
 
6.0%
24 412
 
4.1%
25 237
 
2.4%
26 122
 
1.2%
27 107
 
1.1%
28 228
 
2.3%
29 226
 
2.3%
ValueCountFrequency (%)
98 1
 
< 0.1%
97 1
 
< 0.1%
96 4
< 0.1%
95 1
 
< 0.1%
93 2
< 0.1%
83 3
< 0.1%
82 4
< 0.1%
81 1
 
< 0.1%
80 2
< 0.1%
79 4
< 0.1%

전원상태(꺼짐:0, 켜짐 :1)
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2024-05-04T00:23:16.565682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T00:23:16.857815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%
Distinct9392
Distinct (%)93.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2024-01-08 00:01:12
Maximum2024-01-14 23:59:54
2024-05-04T00:23:17.159616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:23:17.634840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-05-04T00:23:10.567313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:23:10.087353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:23:10.838476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:23:10.283936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-04T00:23:17.901471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시리얼가동유무사용전력(W)빛의밝기(%)
시리얼1.0000.7090.6540.690
가동유무0.7091.0000.1650.102
사용전력(W)0.6540.1651.0000.027
빛의밝기(%)0.6900.1020.0271.000
2024-05-04T00:23:18.155576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용전력(W)빛의밝기(%)가동유무
사용전력(W)1.0000.0750.165
빛의밝기(%)0.0751.0000.078
가동유무0.1650.0781.000

Missing values

2024-05-04T00:23:11.238344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-04T00:23:11.759427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관 명모델명시리얼가동유무사용전력(W)빛의밝기(%)전원상태(꺼짐:0, 켜짐 :1)등록일자
22221양천구3406a309a539d102012024-01-13 23:22:11
4454양천구340c11fce50b9102312024-01-09 07:33:23
23726양천구340a8080bd7471323712024-01-14 09:24:02
21286양천구340c559c3b49d002012024-01-13 19:46:54
15311양천구340ea961e36981333312024-01-12 17:52:39
6384양천구34011c90618b21922012024-01-09 21:34:36
19264양천구340f4350aa35013254112024-01-13 12:58:49
9937양천구3405d2ce46157002012024-01-10 23:57:41
2765양천구340bf813bad95002612024-01-08 19:15:55
12144양천구34005d372d93711262012024-01-11 18:48:26
기관 명모델명시리얼가동유무사용전력(W)빛의밝기(%)전원상태(꺼짐:0, 켜짐 :1)등록일자
13953양천구340ba77c5964512242012024-01-12 07:35:09
22214양천구340f530a70d3a1692012024-01-13 23:20:44
7775양천구340e1ba1dbe080112012024-01-10 06:50:31
7447양천구340f4350aa350002012024-01-10 04:50:41
11599양천구34085b819453c1482512024-01-11 14:42:01
22346양천구340a888aab6a9002012024-01-13 23:50:51
12252양천구34008ccb88d441332312024-01-11 19:40:07
14662양천구340ea961e3698003412024-01-12 13:06:47
508양천구340538e6d59f4002012024-01-08 03:54:39
8661양천구3404c1da7fd41002512024-01-10 13:27:07

Duplicate rows

Most frequently occurring

기관 명모델명시리얼가동유무사용전력(W)빛의밝기(%)전원상태(꺼짐:0, 켜짐 :1)등록일자# duplicates
0양천구34005d372d9371242012024-01-13 05:58:092
1양천구34005d372d9371252012024-01-13 23:58:042
2양천구34005d372d93711232012024-01-13 07:57:022
3양천구34005d372d93711262012024-01-13 18:56:582
4양천구34008ccb88d44002012024-01-13 00:50:232
5양천구34008ccb88d44002612024-01-13 15:53:112
6양천구34008ccb88d441292012024-01-13 07:31:542
7양천구34008ccb88d441292212024-01-13 08:14:042
8양천구34008ccb88d441292312024-01-13 08:26:522
9양천구34008ccb88d441292312024-01-13 19:12:342