Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows453
Duplicate rows (%)4.5%
Total size in memory752.0 KiB
Average record size in memory77.0 B

Variable types

Categorical4
Text1
Numeric2
DateTime1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15962/S/1/datasetView.do

Alerts

기관 명 has constant value ""Constant
모델명 has constant value ""Constant
전원상태(꺼짐:0, 켜짐 :1) has constant value ""Constant
Dataset has 453 (4.5%) duplicate rowsDuplicates
사용전력(W) has 4929 (49.3%) zerosZeros

Reproduction

Analysis started2024-05-11 16:08:47.845547
Analysis finished2024-05-11 16:08:50.141215
Duration2.3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관 명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
양천구
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양천구
2nd row양천구
3rd row양천구
4th row양천구
5th row양천구

Common Values

ValueCountFrequency (%)
양천구 10000
100.0%

Length

2024-05-12T01:08:50.329840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:08:50.615245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양천구 10000
100.0%

모델명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
340
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row340
2nd row340
3rd row340
4th row340
5th row340

Common Values

ValueCountFrequency (%)
340 10000
100.0%

Length

2024-05-12T01:08:50.913398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:08:51.202109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
340 10000
100.0%
Distinct73
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-12T01:08:52.028718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters100000
Distinct characters16
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowec036093df
2nd row08ccb88d44
3rd row5adb0f7e18
4th rowe1ba1dbe08
5th rowe1ba1dbe08
ValueCountFrequency (%)
e1ba1dbe08 2061
 
20.6%
de28c29733 504
 
5.0%
ba77c59645 304
 
3.0%
08ccb88d44 273
 
2.7%
a8080bd747 244
 
2.4%
c11fce50b9 219
 
2.2%
7540803cd5 167
 
1.7%
6a309a539d 155
 
1.6%
7abcf2427a 147
 
1.5%
ea961e3698 142
 
1.4%
Other values (63) 5784
57.8%
2024-05-12T01:08:53.032209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 8620
 
8.6%
b 8613
 
8.6%
8 7965
 
8.0%
1 7821
 
7.8%
d 7687
 
7.7%
a 7537
 
7.5%
7 6487
 
6.5%
0 6413
 
6.4%
5 5753
 
5.8%
3 5410
 
5.4%
Other values (6) 27694
27.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 59032
59.0%
Lowercase Letter 40968
41.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 7965
13.5%
1 7821
13.2%
7 6487
11.0%
0 6413
10.9%
5 5753
9.7%
3 5410
9.2%
2 5374
9.1%
9 5330
9.0%
4 4295
7.3%
6 4184
7.1%
Lowercase Letter
ValueCountFrequency (%)
e 8620
21.0%
b 8613
21.0%
d 7687
18.8%
a 7537
18.4%
c 4838
11.8%
f 3673
9.0%

Most occurring scripts

ValueCountFrequency (%)
Common 59032
59.0%
Latin 40968
41.0%

Most frequent character per script

Common
ValueCountFrequency (%)
8 7965
13.5%
1 7821
13.2%
7 6487
11.0%
0 6413
10.9%
5 5753
9.7%
3 5410
9.2%
2 5374
9.1%
9 5330
9.0%
4 4295
7.3%
6 4184
7.1%
Latin
ValueCountFrequency (%)
e 8620
21.0%
b 8613
21.0%
d 7687
18.8%
a 7537
18.4%
c 4838
11.8%
f 3673
9.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 8620
 
8.6%
b 8613
 
8.6%
8 7965
 
8.0%
1 7821
 
7.8%
d 7687
 
7.7%
a 7537
 
7.5%
7 6487
 
6.5%
0 6413
 
6.4%
5 5753
 
5.8%
3 5410
 
5.4%
Other values (6) 27694
27.7%

가동유무
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
5259 
1
4741 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
0 5259
52.6%
1 4741
47.4%

Length

2024-05-12T01:08:53.258626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:08:53.417798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 5259
52.6%
1 4741
47.4%

사용전력(W)
Real number (ℝ)

ZEROS 

Distinct234
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.6211
Minimum0
Maximum1650
Zeros4929
Zeros (%)49.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-12T01:08:53.620492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q332
95-th percentile85
Maximum1650
Range1650
Interquartile range (IQR)32

Descriptive statistics

Standard deviation82.904759
Coefficient of variation (CV)3.001501
Kurtosis149.44786
Mean27.6211
Median Absolute Deviation (MAD)1
Skewness10.876087
Sum276211
Variance6873.1991
MonotonicityNot monotonic
2024-05-12T01:08:53.861613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 4929
49.3%
6 449
 
4.5%
63 221
 
2.2%
30 196
 
2.0%
29 187
 
1.9%
32 181
 
1.8%
33 170
 
1.7%
13 166
 
1.7%
22 164
 
1.6%
64 159
 
1.6%
Other values (224) 3178
31.8%
ValueCountFrequency (%)
0 4929
49.3%
1 129
 
1.3%
2 17
 
0.2%
3 58
 
0.6%
4 32
 
0.3%
5 29
 
0.3%
6 449
 
4.5%
7 18
 
0.2%
8 10
 
0.1%
9 3
 
< 0.1%
ValueCountFrequency (%)
1650 1
 
< 0.1%
1434 1
 
< 0.1%
1422 1
 
< 0.1%
1410 1
 
< 0.1%
1254 1
 
< 0.1%
1248 2
< 0.1%
1246 3
< 0.1%
1240 1
 
< 0.1%
1238 1
 
< 0.1%
1228 2
< 0.1%

빛의밝기(%)
Real number (ℝ)

Distinct68
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.6025
Minimum20
Maximum99
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-12T01:08:54.273706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile20
Q120
median20
Q324
95-th percentile38
Maximum99
Range79
Interquartile range (IQR)4

Descriptive statistics

Standard deviation7.4048212
Coefficient of variation (CV)0.31373038
Kurtosis19.165915
Mean23.6025
Median Absolute Deviation (MAD)0
Skewness3.7316738
Sum236025
Variance54.831377
MonotonicityNot monotonic
2024-05-12T01:08:54.536531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20 5286
52.9%
21 986
 
9.9%
22 631
 
6.3%
23 519
 
5.2%
24 435
 
4.3%
29 276
 
2.8%
30 227
 
2.3%
25 205
 
2.1%
28 195
 
1.9%
31 117
 
1.2%
Other values (58) 1123
 
11.2%
ValueCountFrequency (%)
20 5286
52.9%
21 986
 
9.9%
22 631
 
6.3%
23 519
 
5.2%
24 435
 
4.3%
25 205
 
2.1%
26 105
 
1.1%
27 104
 
1.0%
28 195
 
1.9%
29 276
 
2.8%
ValueCountFrequency (%)
99 3
< 0.1%
97 1
 
< 0.1%
96 1
 
< 0.1%
95 1
 
< 0.1%
84 2
< 0.1%
83 2
< 0.1%
81 1
 
< 0.1%
80 2
< 0.1%
79 1
 
< 0.1%
78 2
< 0.1%

전원상태(꺼짐:0, 켜짐 :1)
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2024-05-12T01:08:54.776482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:08:54.928452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%
Distinct9460
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2024-01-15 00:03:04
Maximum2024-01-21 23:58:52
2024-05-12T01:08:55.097592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:08:55.331163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-05-12T01:08:48.881424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:08:48.342428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:08:49.153634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-12T01:08:48.604267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-12T01:08:55.490159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시리얼가동유무사용전력(W)빛의밝기(%)
시리얼1.0000.6560.6410.665
가동유무0.6561.0000.1510.118
사용전력(W)0.6410.1511.0000.059
빛의밝기(%)0.6650.1180.0591.000
2024-05-12T01:08:55.645631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용전력(W)빛의밝기(%)가동유무
사용전력(W)1.0000.1020.151
빛의밝기(%)0.1021.0000.090
가동유무0.1510.0901.000

Missing values

2024-05-12T01:08:49.530383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-12T01:08:49.954845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관 명모델명시리얼가동유무사용전력(W)빛의밝기(%)전원상태(꺼짐:0, 켜짐 :1)등록일자
8563양천구340ec036093df003312024-01-16 15:17:34
10977양천구34008ccb88d441292012024-01-17 08:11:34
14802양천구3405adb0f7e18003212024-01-18 13:35:26
19025양천구340e1ba1dbe08002012024-01-19 19:14:30
2263양천구340e1ba1dbe081632112024-01-15 09:25:09
9218양천구3407abcf2427a13162012024-01-16 19:37:16
10244양천구340c11fce50b9002012024-01-17 02:58:46
21375양천구340e1ba1dbe08042012024-01-20 11:16:25
1742양천구340e1ba1dbe08062012024-01-15 07:53:07
19892양천구340e1ba1dbe081642012024-01-20 01:03:31
기관 명모델명시리얼가동유무사용전력(W)빛의밝기(%)전원상태(꺼짐:0, 켜짐 :1)등록일자
9770양천구340e1ba1dbe08102012024-01-16 23:41:04
12874양천구340bdaa064145002012024-01-17 22:57:53
24981양천구3402de72cfce9002712024-01-21 13:07:42
26218양천구340d796bdf0f6012012024-01-21 22:26:13
3641양천구34085b819453c1472512024-01-15 13:53:58
15599양천구340e1ba1dbe081622112024-01-18 18:28:36
16643양천구340de28c29733002012024-01-19 01:53:36
9973양천구340c11fce50b9002012024-01-17 00:56:50
3657양천구3405d2ce46157003412024-01-15 13:56:55
524양천구340c8502fa322002012024-01-15 02:37:09

Duplicate rows

Most frequently occurring

기관 명모델명시리얼가동유무사용전력(W)빛의밝기(%)전원상태(꺼짐:0, 켜짐 :1)등록일자# duplicates
0양천구34005d372d9371252012024-01-15 02:56:432
1양천구34005d372d93711182012024-01-15 16:57:012
2양천구34005d372d93711192012024-01-15 01:48:042
3양천구34005d372d93711192012024-01-15 09:37:142
4양천구34005d372d93711192012024-01-15 16:57:122
5양천구34008ccb88d44002312024-01-15 16:53:582
6양천구34008ccb88d44002812024-01-15 15:53:592
7양천구34008ccb88d44003512024-01-15 14:54:002
8양천구34008ccb88d44004712024-01-15 11:54:002
9양천구34008ccb88d440305612024-01-15 12:06:292