Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells20000
Missing cells (%)25.0%
Duplicate rows12
Duplicate rows (%)0.1%
Total size in memory742.2 KiB
Average record size in memory76.0 B

Variable types

Categorical4
DateTime1
Unsupported2
Numeric1

Alerts

설치장소명 has constant value ""Constant
디바이스명 has constant value ""Constant
보고형태 has constant value ""Constant
전기전도도측정값(m/S) has constant value ""Constant
Dataset has 12 (0.1%) duplicate rowsDuplicates
수온측정값(℃) has 10000 (100.0%) missing valuesMissing
배터리전압측정값(V) has 10000 (100.0%) missing valuesMissing
수온측정값(℃) is an unsupported type, check if it needs cleaning or further analysisUnsupported
배터리전압측정값(V) is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 22:07:08.017320
Analysis finished2023-12-10 22:07:08.486078
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

설치장소명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
노래하는 분수
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row노래하는 분수
2nd row노래하는 분수
3rd row노래하는 분수
4th row노래하는 분수
5th row노래하는 분수

Common Values

ValueCountFrequency (%)
노래하는 분수 10000
100.0%

Length

2023-12-11T07:07:08.544903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:07:08.634434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
노래하는 10000
50.0%
분수 10000
50.0%

디바이스명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
waterquality
10000 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwaterquality
2nd rowwaterquality
3rd rowwaterquality
4th rowwaterquality
5th rowwaterquality

Common Values

ValueCountFrequency (%)
waterquality 10000
100.0%

Length

2023-12-11T07:07:08.714371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:07:08.836576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
waterquality 10000
100.0%
Distinct9988
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-07-21 09:08:22
Maximum2021-04-24 10:43:22
2023-12-11T07:07:08.958606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:07:09.319154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

보고형태
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
report
10000 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowreport
2nd rowreport
3rd rowreport
4th rowreport
5th rowreport

Common Values

ValueCountFrequency (%)
report 10000
100.0%

Length

2023-12-11T07:07:09.429796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:07:09.519603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
report 10000
100.0%

수온측정값(℃)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

전기전도도측정값(m/S)
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 10000
100.0%

Length

2023-12-11T07:07:09.598426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:07:09.681144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 10000
100.0%

탁도측정값(NTU)
Real number (ℝ)

Distinct575
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.934754
Minimum0
Maximum1477.15
Zeros12
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:07:09.769257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile12.3
Q113.34
median13.87
Q314.61
95-th percentile16.35
Maximum1477.15
Range1477.15
Interquartile range (IQR)1.27

Descriptive statistics

Standard deviation97.912908
Coefficient of variation (CV)4.2691937
Kurtosis165.31312
Mean22.934754
Median Absolute Deviation (MAD)0.62
Skewness12.522564
Sum229347.54
Variance9586.9376
MonotonicityNot monotonic
2023-12-11T07:07:09.888975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
14.01 123
 
1.2%
13.41 120
 
1.2%
13.83 116
 
1.2%
13.9 114
 
1.1%
13.43 112
 
1.1%
13.27 112
 
1.1%
13.73 111
 
1.1%
13.29 111
 
1.1%
13.55 110
 
1.1%
13.64 110
 
1.1%
Other values (565) 8861
88.6%
ValueCountFrequency (%)
0.0 12
0.1%
0.08 1
 
< 0.1%
0.36 1
 
< 0.1%
1.15 1
 
< 0.1%
1.31 1
 
< 0.1%
2.12 1
 
< 0.1%
2.59 1
 
< 0.1%
3.26 1
 
< 0.1%
3.35 1
 
< 0.1%
3.68 1
 
< 0.1%
ValueCountFrequency (%)
1477.15 16
0.2%
1473.18 1
 
< 0.1%
1471.91 1
 
< 0.1%
1459.28 1
 
< 0.1%
1455.31 1
 
< 0.1%
1454.62 1
 
< 0.1%
1448.24 1
 
< 0.1%
1412.95 1
 
< 0.1%
1393.78 1
 
< 0.1%
1392.11 1
 
< 0.1%

배터리전압측정값(V)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Interactions

2023-12-11T07:07:08.224026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T07:07:08.333868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:07:08.435975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

설치장소명디바이스명수집시간보고형태수온측정값(℃)전기전도도측정값(m/S)탁도측정값(NTU)배터리전압측정값(V)
40092노래하는 분수waterquality2020-10-25 21:08:22report<NA>011.63<NA>
5871노래하는 분수waterquality2021-04-03 23:28:22report<NA>014.01<NA>
15617노래하는 분수waterquality2021-01-28 18:38:21report<NA>014.78<NA>
47003노래하는 분수waterquality2020-10-01 21:08:22report<NA>014.94<NA>
28748노래하는 분수waterquality2020-12-05 03:03:22report<NA>013.32<NA>
12992노래하는 분수waterquality2021-03-11 05:48:22report<NA>013.32<NA>
17469노래하는 분수waterquality2021-01-14 12:28:22report<NA>013.85<NA>
43054노래하는 분수waterquality2020-10-15 14:18:22report<NA>053.23<NA>
28850노래하는 분수waterquality2020-12-04 18:33:22report<NA>013.36<NA>
33935노래하는 분수waterquality2020-11-17 02:18:22report<NA>014.22<NA>
설치장소명디바이스명수집시간보고형태수온측정값(℃)전기전도도측정값(m/S)탁도측정값(NTU)배터리전압측정값(V)
32508노래하는 분수waterquality2020-11-22 01:18:22report<NA>013.69<NA>
5910노래하는 분수waterquality2021-04-03 20:13:22report<NA>013.8<NA>
46993노래하는 분수waterquality2020-10-01 21:58:22report<NA>014.94<NA>
20179노래하는 분수waterquality2021-01-04 01:08:22report<NA>014.29<NA>
8560노래하는 분수waterquality2021-03-25 15:23:22report<NA>013.46<NA>
35976노래하는 분수waterquality2020-11-09 23:18:22report<NA>013.22<NA>
39411노래하는 분수waterquality2020-10-28 05:53:22report<NA>016.26<NA>
28685노래하는 분수waterquality2020-12-05 08:23:22report<NA>013.25<NA>
9163노래하는 분수waterquality2021-03-23 13:03:22report<NA>012.2<NA>
55357노래하는 분수waterquality2020-07-21 21:43:22report<NA>014.29<NA>

Duplicate rows

Most frequently occurring

설치장소명디바이스명수집시간보고형태전기전도도측정값(m/S)탁도측정값(NTU)# duplicates
0노래하는 분수waterquality2021-03-21 23:43:22report013.732
1노래하는 분수waterquality2021-03-22 03:33:22report014.22
2노래하는 분수waterquality2021-03-22 11:38:22report014.132
3노래하는 분수waterquality2021-03-22 11:53:22report013.922
4노래하는 분수waterquality2021-03-22 12:38:22report014.012
5노래하는 분수waterquality2021-03-22 15:53:22report013.972
6노래하는 분수waterquality2021-03-22 16:38:23report013.922
7노래하는 분수waterquality2021-03-22 17:18:22report013.92
8노래하는 분수waterquality2021-03-22 19:38:22report013.92
9노래하는 분수waterquality2021-03-22 20:08:22report014.112