Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory576.2 KiB
Average record size in memory59.0 B

Variable types

DateTime1
Numeric3
Categorical2

Dataset

Description부산광역시_열섬관측지점정보_20240229
Author부산광역시
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15120945

Alerts

센서항목코드 is highly overall correlated with 센서값 and 1 other fieldsHigh correlation
센서 코드 is highly overall correlated with 센서명High correlation
센서값 is highly overall correlated with 센서항목코드 and 1 other fieldsHigh correlation
센서항목 is highly overall correlated with 센서항목코드 and 1 other fieldsHigh correlation
센서명 is highly overall correlated with 센서 코드High correlation

Reproduction

Analysis started2024-03-13 13:15:30.912158
Analysis finished2024-03-13 13:15:32.640094
Duration1.73 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1042
Distinct (%)10.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2024-02-01 00:00:00
Maximum2024-02-04 14:45:00
2024-03-13T22:15:32.714596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:32.895868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

센서항목코드
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27374.31
Minimum8192
Maximum61440
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T22:15:33.075259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8192
5-th percentile8192
Q18448
median28672
Q329696
95-th percentile61440
Maximum61440
Range53248
Interquartile range (IQR)21248

Descriptive statistics

Standard deviation17692.669
Coefficient of variation (CV)0.64632381
Kurtosis-0.30564335
Mean27374.31
Median Absolute Deviation (MAD)1024
Skewness0.74358605
Sum2.737431 × 108
Variance3.1303053 × 108
MonotonicityNot monotonic
2024-03-13T22:15:33.197684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
8192 1692
16.9%
8448 1675
16.8%
29696 1675
16.8%
28672 1674
16.7%
28928 1654
16.5%
61440 1630
16.3%
ValueCountFrequency (%)
8192 1692
16.9%
8448 1675
16.8%
28672 1674
16.7%
28928 1654
16.5%
29696 1675
16.8%
61440 1630
16.3%
ValueCountFrequency (%)
61440 1630
16.3%
29696 1675
16.8%
28928 1654
16.5%
28672 1674
16.7%
8448 1675
16.8%
8192 1692
16.9%

센서항목
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
온도
1692 
습도
1675 
기압
1675 
풍향
1674 
풍속
1654 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row온도
2nd row기타
3rd row풍속
4th row풍향
5th row습도

Common Values

ValueCountFrequency (%)
온도 1692
16.9%
습도 1675
16.8%
기압 1675
16.8%
풍향 1674
16.7%
풍속 1654
16.5%
기타 1630
16.3%

Length

2024-03-13T22:15:33.316345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:15:33.435200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
온도 1692
16.9%
습도 1675
16.8%
기압 1675
16.8%
풍향 1674
16.7%
풍속 1654
16.5%
기타 1630
16.3%

센서 코드
Real number (ℝ)

HIGH CORRELATION 

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.4202
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T22:15:33.572535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median8
Q312
95-th percentile16
Maximum16
Range15
Interquartile range (IQR)8

Descriptive statistics

Standard deviation4.5515606
Coefficient of variation (CV)0.54055255
Kurtosis-1.1922435
Mean8.4202
Median Absolute Deviation (MAD)4
Skewness0.026390918
Sum84202
Variance20.716704
MonotonicityNot monotonic
2024-03-13T22:15:33.692455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
4 685
 
6.9%
6 658
 
6.6%
8 657
 
6.6%
11 646
 
6.5%
5 645
 
6.5%
2 641
 
6.4%
13 629
 
6.3%
9 624
 
6.2%
7 618
 
6.2%
12 617
 
6.2%
Other values (6) 3580
35.8%
ValueCountFrequency (%)
1 584
5.8%
2 641
6.4%
3 607
6.1%
4 685
6.9%
5 645
6.5%
6 658
6.6%
7 618
6.2%
8 657
6.6%
9 624
6.2%
10 616
6.2%
ValueCountFrequency (%)
16 574
5.7%
15 587
5.9%
14 612
6.1%
13 629
6.3%
12 617
6.2%
11 646
6.5%
10 616
6.2%
9 624
6.2%
8 657
6.6%
7 618
6.2%

센서명
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영도구
685 
남구
 
658
해운대구
 
657
강서구
 
646
부산진구
 
645
Other values (11)
6709 

Length

Max length4
Median length3
Mean length2.8194
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산진구
2nd row부산진구
3rd row영도구
4th row기장군
5th row연제구

Common Values

ValueCountFrequency (%)
영도구 685
 
6.9%
남구 658
 
6.6%
해운대구 657
 
6.6%
강서구 646
 
6.5%
부산진구 645
 
6.5%
서구 641
 
6.4%
수영구 629
 
6.3%
사하구 624
 
6.2%
북구 618
 
6.2%
연제구 617
 
6.2%
Other values (6) 3580
35.8%

Length

2024-03-13T22:15:33.864238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
영도구 685
 
6.9%
남구 658
 
6.6%
해운대구 657
 
6.6%
강서구 646
 
6.5%
부산진구 645
 
6.5%
서구 641
 
6.4%
수영구 629
 
6.3%
사하구 624
 
6.2%
북구 618
 
6.2%
연제구 617
 
6.2%
Other values (6) 3580
35.8%

센서값
Real number (ℝ)

HIGH CORRELATION 

Distinct9932
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3632427.3
Minimum-99.957146
Maximum30767850
Zeros27
Zeros (%)0.3%
Negative91
Negative (%)0.9%
Memory size166.0 KiB
2024-03-13T22:15:34.023620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-99.957146
5-th percentile0.40424937
Q16.9012158
median89.238141
Q31025.286
95-th percentile30507003
Maximum30767850
Range30767950
Interquartile range (IQR)1018.3848

Descriptive statistics

Standard deviation9437238.8
Coefficient of variation (CV)2.5980531
Kurtosis3.5459242
Mean3632427.3
Median Absolute Deviation (MAD)88.948687
Skewness2.3246489
Sum3.6324273 × 1010
Variance8.9061476 × 1013
MonotonicityNot monotonic
2024-03-13T22:15:34.176639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 27
 
0.3%
100.0 15
 
0.1%
359.973999 2
 
< 0.1%
1025.080005 2
 
< 0.1%
1022.464001 2
 
< 0.1%
354.881012 2
 
< 0.1%
1025.307983 2
 
< 0.1%
1022.1 2
 
< 0.1%
355.6589966 2
 
< 0.1%
341.2980042 2
 
< 0.1%
Other values (9922) 9942
99.4%
ValueCountFrequency (%)
-99.95714613 1
< 0.1%
-99.95663423 1
< 0.1%
-99.95625128 1
< 0.1%
-99.95615642 1
< 0.1%
-99.95599962 1
< 0.1%
-99.95577232 1
< 0.1%
-99.95545868 1
< 0.1%
-99.95527903 1
< 0.1%
-99.9546418 1
< 0.1%
-99.95290679 1
< 0.1%
ValueCountFrequency (%)
30767850.0 1
< 0.1%
30766250.0 1
< 0.1%
30762450.0 1
< 0.1%
30761550.0 1
< 0.1%
30761250.0 1
< 0.1%
30760860.0 1
< 0.1%
30759450.0 1
< 0.1%
30759318.18 1
< 0.1%
30758850.0 1
< 0.1%
30758153.85 1
< 0.1%

Interactions

2024-03-13T22:15:32.085077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:31.375435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:31.723474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:32.206355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:31.473930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:31.824784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:32.322846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:31.583457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:31.977314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T22:15:34.279829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
센서항목코드센서항목센서 코드센서명센서값
센서항목코드1.0001.0000.0000.0000.850
센서항목1.0001.0000.0000.0000.678
센서 코드0.0000.0001.0001.0000.381
센서명0.0000.0001.0001.0000.642
센서값0.8500.6780.3810.6421.000
2024-03-13T22:15:34.380545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
센서명센서항목
센서명1.0000.000
센서항목0.0001.000
2024-03-13T22:15:34.775637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
센서항목코드센서 코드센서값센서항목센서명
센서항목코드1.000-0.0040.6521.0000.000
센서 코드-0.0041.000-0.0470.0001.000
센서값0.652-0.0471.0000.5070.352
센서항목1.0000.0000.5071.0000.000
센서명0.0001.0000.3520.0001.000

Missing values

2024-03-13T22:15:32.453571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T22:15:32.583470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

날짜 및 시간센서항목코드센서항목센서 코드센서명센서값
568062024-02-03 01:15:008192온도5부산진구6.579934
745652024-02-03 16:40:0061440기타5부산진구1021045.0
752292024-02-03 17:15:0028928풍속4영도구0.445113
905642024-02-04 06:35:0028672풍향15기장군301.981995
581992024-02-03 02:30:008448습도12연제구70.484149
700612024-02-03 12:45:008448습도6남구68.928381
509412024-02-02 20:10:0028928풍속4영도구0.60617
474102024-02-02 17:05:008192온도7북구8.222199
912202024-02-04 07:10:0029696기압12연제구1025.160013
117792024-02-01 10:10:0028928풍속5부산진구0.70029
날짜 및 시간센서항목코드센서항목센서 코드센서명센서값
511812024-02-02 20:25:0028928풍속11강서구0.360715
837162024-02-04 00:40:008192온도1중구6.604398
790742024-02-03 20:35:0028672풍향5부산진구209.807999
703392024-02-03 13:00:0028928풍속5부산진구1.087537
58262024-02-01 05:00:0028672풍향5부산진구168.25
53382024-02-01 04:35:008192온도3동구9.705275
918902024-02-04 07:45:0028672풍향12연제구246.098999
38432024-02-01 03:20:0061440기타1중구30456620.0
72892024-02-01 06:15:008448습도8해운대구84.854605
489312024-02-02 18:25:0028928풍속5부산진구0.659956