Overview

Dataset statistics

Number of variables6
Number of observations27
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory56.9 B

Variable types

Numeric4
Text2

Dataset

Description고유번호,관측소명,주소,평균기온,X 좌표,Y 좌표
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-361/S/1/datasetView.do

Alerts

고유번호 is highly overall correlated with Y 좌표High correlation
평균기온 is highly overall correlated with Y 좌표High correlation
Y 좌표 is highly overall correlated with 고유번호 and 1 other fieldsHigh correlation
고유번호 has unique valuesUnique
관측소명 has unique valuesUnique
주소 has unique valuesUnique
평균기온 has unique valuesUnique
X 좌표 has unique valuesUnique
Y 좌표 has unique valuesUnique

Reproduction

Analysis started2023-12-11 08:45:30.552280
Analysis finished2023-12-11 08:45:32.830770
Duration2.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

고유번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct27
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14
Minimum1
Maximum27
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size375.0 B
2023-12-11T17:45:32.903707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.3
Q17.5
median14
Q320.5
95-th percentile25.7
Maximum27
Range26
Interquartile range (IQR)13

Descriptive statistics

Standard deviation7.9372539
Coefficient of variation (CV)0.56694671
Kurtosis-1.2
Mean14
Median Absolute Deviation (MAD)7
Skewness0
Sum378
Variance63
MonotonicityStrictly increasing
2023-12-11T17:45:33.025201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
1 1
 
3.7%
2 1
 
3.7%
27 1
 
3.7%
26 1
 
3.7%
25 1
 
3.7%
24 1
 
3.7%
23 1
 
3.7%
22 1
 
3.7%
21 1
 
3.7%
20 1
 
3.7%
Other values (17) 17
63.0%
ValueCountFrequency (%)
1 1
3.7%
2 1
3.7%
3 1
3.7%
4 1
3.7%
5 1
3.7%
6 1
3.7%
7 1
3.7%
8 1
3.7%
9 1
3.7%
10 1
3.7%
ValueCountFrequency (%)
27 1
3.7%
26 1
3.7%
25 1
3.7%
24 1
3.7%
23 1
3.7%
22 1
3.7%
21 1
3.7%
20 1
3.7%
19 1
3.7%
18 1
3.7%

관측소명
Text

UNIQUE 

Distinct27
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-11T17:45:33.223426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length2
Mean length2.1851852
Min length2

Characters and Unicode

Total characters59
Distinct characters39
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row관악
2nd row금천
3rd row서초
4th row구로
5th row기상청
ValueCountFrequency (%)
관악 1
 
3.7%
강서 1
 
3.7%
도봉 1
 
3.7%
강북 1
 
3.7%
노원 1
 
3.7%
북한산 1
 
3.7%
성북 1
 
3.7%
은평 1
 
3.7%
중랑 1
 
3.7%
동대문 1
 
3.7%
Other values (17) 17
63.0%
2023-12-11T17:45:33.569798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5
 
8.5%
4
 
6.8%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
Other values (29) 32
54.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
 
8.5%
4
 
6.8%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
Other values (29) 32
54.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
 
8.5%
4
 
6.8%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
Other values (29) 32
54.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5
 
8.5%
4
 
6.8%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
Other values (29) 32
54.2%

주소
Text

UNIQUE 

Distinct27
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-11T17:45:33.870402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length28
Mean length27.333333
Min length22

Characters and Unicode

Total characters738
Distinct characters129
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row서울특별시 관악구 신림동 산56-1 (서울대학교)
2nd row서울특별시 금천구 독산동 1034 (독산초등학교)
3rd row서울특별시 서초구 서초동 1650 (서울교육대학교)
4th row서울특별시 구로구 궁동 213-42 (수궁동사무소)
5th row서울특별시 동작구 신대방동 460-18 (기상청)
ValueCountFrequency (%)
서울특별시 27
 
19.9%
영등포구 2
 
1.5%
종로구 2
 
1.5%
신촌동 1
 
0.7%
551 1
 
0.7%
면목동 1
 
0.7%
중랑구 1
 
0.7%
서울시립대 1
 
0.7%
90 1
 
0.7%
전농동 1
 
0.7%
Other values (98) 98
72.1%
2023-12-11T17:45:34.353477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
109
 
14.8%
35
 
4.7%
35
 
4.7%
31
 
4.2%
31
 
4.2%
1 31
 
4.2%
28
 
3.8%
( 28
 
3.8%
27
 
3.7%
27
 
3.7%
Other values (119) 356
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 458
62.1%
Space Separator 109
 
14.8%
Decimal Number 99
 
13.4%
Open Punctuation 28
 
3.8%
Close Punctuation 27
 
3.7%
Dash Punctuation 17
 
2.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
7.6%
35
 
7.6%
31
 
6.8%
31
 
6.8%
28
 
6.1%
27
 
5.9%
27
 
5.9%
17
 
3.7%
15
 
3.3%
11
 
2.4%
Other values (105) 201
43.9%
Decimal Number
ValueCountFrequency (%)
1 31
31.3%
2 12
 
12.1%
4 10
 
10.1%
0 10
 
10.1%
3 9
 
9.1%
5 7
 
7.1%
6 6
 
6.1%
9 6
 
6.1%
8 5
 
5.1%
7 3
 
3.0%
Space Separator
ValueCountFrequency (%)
109
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 458
62.1%
Common 280
37.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
7.6%
35
 
7.6%
31
 
6.8%
31
 
6.8%
28
 
6.1%
27
 
5.9%
27
 
5.9%
17
 
3.7%
15
 
3.3%
11
 
2.4%
Other values (105) 201
43.9%
Common
ValueCountFrequency (%)
109
38.9%
1 31
 
11.1%
( 28
 
10.0%
) 27
 
9.6%
- 17
 
6.1%
2 12
 
4.3%
4 10
 
3.6%
0 10
 
3.6%
3 9
 
3.2%
5 7
 
2.5%
Other values (4) 20
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 458
62.1%
ASCII 280
37.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
109
38.9%
1 31
 
11.1%
( 28
 
10.0%
) 27
 
9.6%
- 17
 
6.1%
2 12
 
4.3%
4 10
 
3.6%
0 10
 
3.6%
3 9
 
3.2%
5 7
 
2.5%
Other values (4) 20
 
7.1%
Hangul
ValueCountFrequency (%)
35
 
7.6%
35
 
7.6%
31
 
6.8%
31
 
6.8%
28
 
6.1%
27
 
5.9%
27
 
5.9%
17
 
3.7%
15
 
3.3%
11
 
2.4%
Other values (105) 201
43.9%

평균기온
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct27
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.13902289
Minimum-2.605182
Maximum1.547382
Zeros0
Zeros (%)0.0%
Negative10
Negative (%)37.0%
Memory size375.0 B
2023-12-11T17:45:34.528872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-2.605182
5-th percentile-1.2383707
Q1-0.45235
median0.434751
Q30.7119615
95-th percentile1.0291336
Maximum1.547382
Range4.152564
Interquartile range (IQR)1.1643115

Descriptive statistics

Standard deviation0.90840888
Coefficient of variation (CV)6.5342397
Kurtosis1.7473685
Mean0.13902289
Median Absolute Deviation (MAD)0.442307
Skewness-1.1917192
Sum3.753618
Variance0.82520669
MonotonicityNot monotonic
2023-12-11T17:45:34.665670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
-1.065771 1
 
3.7%
1.547382 1
 
3.7%
0.038765 1
 
3.7%
-0.652582 1
 
3.7%
0.434751 1
 
3.7%
-0.941102 1
 
3.7%
-2.605182 1
 
3.7%
-0.413166 1
 
3.7%
-0.052481 1
 
3.7%
0.3828 1
 
3.7%
Other values (17) 17
63.0%
ValueCountFrequency (%)
-2.605182 1
3.7%
-1.312342 1
3.7%
-1.065771 1
3.7%
-0.941102 1
3.7%
-0.652582 1
3.7%
-0.525573 1
3.7%
-0.491534 1
3.7%
-0.413166 1
3.7%
-0.124186 1
3.7%
-0.052481 1
3.7%
ValueCountFrequency (%)
1.547382 1
3.7%
1.045957 1
3.7%
0.989879 1
3.7%
0.877058 1
3.7%
0.860559 1
3.7%
0.852671 1
3.7%
0.715912 1
3.7%
0.708011 1
3.7%
0.697345 1
3.7%
0.69474 1
3.7%

X 좌표
Real number (ℝ)

UNIQUE 

Distinct27
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean198523.02
Minimum185073.61
Maximum212927.44
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size375.0 B
2023-12-11T17:45:34.815657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum185073.61
5-th percentile187338.22
Q1194070.45
median197826.73
Q3203753.29
95-th percentile208396.52
Maximum212927.44
Range27853.829
Interquartile range (IQR)9682.8375

Descriptive statistics

Standard deviation7018.1252
Coefficient of variation (CV)0.035351695
Kurtosis-0.53558425
Mean198523.02
Median Absolute Deviation (MAD)5094.497
Skewness0.087213408
Sum5360121.5
Variance49254081
MonotonicityNot monotonic
2023-12-11T17:45:34.953633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
195595.313 1
 
3.7%
192402.085 1
 
3.7%
196776.981 1
 
3.7%
202921.232 1
 
3.7%
199965.82 1
 
3.7%
207707.332 1
 
3.7%
195975.826 1
 
3.7%
199754.524 1
 
3.7%
194139.26 1
 
3.7%
207666.945 1
 
3.7%
Other values (17) 17
63.0%
ValueCountFrequency (%)
185073.607 1
3.7%
186562.686 1
3.7%
189147.812 1
3.7%
191684.361 1
3.7%
192402.085 1
3.7%
192989.465 1
3.7%
194001.636 1
3.7%
194139.26 1
3.7%
194459.567 1
3.7%
195098.983 1
3.7%
ValueCountFrequency (%)
212927.436 1
3.7%
208691.884 1
3.7%
207707.332 1
3.7%
207666.945 1
3.7%
206805.801 1
3.7%
204240.829 1
3.7%
204103.359 1
3.7%
203403.212 1
3.7%
202921.232 1
3.7%
201584.355 1
3.7%

Y 좌표
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct27
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean450169.26
Minimum439273.75
Maximum462938.81
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size375.0 B
2023-12-11T17:45:35.119497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum439273.75
5-th percentile440851.15
Q1446270.19
median449810.32
Q3453734.97
95-th percentile459147.21
Maximum462938.81
Range23665.057
Interquartile range (IQR)7464.776

Descriptive statistics

Standard deviation6014.0389
Coefficient of variation (CV)0.013359506
Kurtosis-0.45562969
Mean450169.26
Median Absolute Deviation (MAD)3950.311
Skewness0.18426658
Sum12154570
Variance36168663
MonotonicityNot monotonic
2023-12-11T17:45:35.305995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
439273.753 1
 
3.7%
440102.985 1
 
3.7%
452738.5 1
 
3.7%
462938.81 1
 
3.7%
459608.887 1
 
3.7%
458069.979 1
 
3.7%
457634.403 1
 
3.7%
457078.64 1
 
3.7%
456847.603 1
 
3.7%
454085.874 1
 
3.7%
Other values (17) 17
63.0%
ValueCountFrequency (%)
439273.753 1
3.7%
440102.985 1
3.7%
442596.883 1
3.7%
442976.2 1
3.7%
444098.001 1
3.7%
445730.587 1
3.7%
445860.012 1
3.7%
446680.368 1
3.7%
447181.371 1
3.7%
447486.565 1
3.7%
ValueCountFrequency (%)
462938.81 1
3.7%
459608.887 1
3.7%
458069.979 1
3.7%
457634.403 1
3.7%
457078.64 1
3.7%
456847.603 1
3.7%
454085.874 1
3.7%
453384.058 1
3.7%
452738.5 1
3.7%
452319.052 1
3.7%

Interactions

2023-12-11T17:45:31.990038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:30.849958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:31.220422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:31.563317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:32.341666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:30.950458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:31.299191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:31.664682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:32.418996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:31.042652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:31.366959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:31.773960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:32.535632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:31.136787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:31.458456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:45:31.897980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T17:45:35.429061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
고유번호관측소명주소평균기온X 좌표Y 좌표
고유번호1.0001.0001.0000.2020.0000.924
관측소명1.0001.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.0001.000
평균기온0.2021.0001.0001.0000.0000.532
X 좌표0.0001.0001.0000.0001.0000.000
Y 좌표0.9241.0001.0000.5320.0001.000
2023-12-11T17:45:35.563856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
고유번호평균기온X 좌표Y 좌표
고유번호1.000-0.4950.3020.978
평균기온-0.4951.000-0.015-0.501
X 좌표0.302-0.0151.0000.326
Y 좌표0.978-0.5010.3261.000

Missing values

2023-12-11T17:45:32.662909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T17:45:32.789593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

고유번호관측소명주소평균기온X 좌표Y 좌표
01관악서울특별시 관악구 신림동 산56-1 (서울대학교)-1.065771195595.313439273.753
12금천서울특별시 금천구 독산동 1034 (독산초등학교)1.547382192402.085440102.985
23서초서울특별시 서초구 서초동 1650 (서울교육대학교)0.989879201584.355442596.883
34구로서울특별시 구로구 궁동 213-42 (수궁동사무소)-0.124186185073.607442976.2
45기상청서울특별시 동작구 신대방동 460-18 (기상청)0.653926192989.465444098.001
56송파서울특별시 송파구 잠실동 40-1 (롯데월드)1.045957208691.884445730.587
67강남서울특별시 강남구 삼성동 42 (삼릉초등학교)0.69474204103.359445860.012
78용산서울특별시 용산구 이촌동 301-75 (신용산초등학교)0.697345197826.735446680.368
89한강서울특별시 영등포구 여의도동 85-1 (세모유람선)0.632861194459.567447181.371
910양천서울특별시 양천구 목동 915 (목동주차장)0.708011189147.812447486.565
고유번호관측소명주소평균기온X 좌표Y 좌표
1718서대문서울특별시 서대문구 신촌동 134 (연세대학교)-0.525573195098.983452319.052
1819동대문서울특별시 동대문구 전농동 90 (서울시립대)0.877058204240.829453384.058
1920중랑서울특별시 중랑구 면목동 551 (면동초등학교)0.3828207666.945454085.874
2021은평서울특별시 은평구 불광동 280-17 (국립환경연구원)-0.052481194139.26456847.603
2122성북서울특별시 성북구 정릉동 861-1 (국민대학교)-0.413166199754.524457078.64
2223북한산서울특별시 종로구 구기동 산1 (승가사)-2.605182195975.826457634.403
2324노원서울특별시 노원구 공릉동 230-3 (육군사관학교)-0.941102207707.332458069.979
2425강북서울특별시 강북구 수유동 192-49 (강북구청 본관)0.434751199965.82459608.887
2526도봉서울특별시 도봉구 방학동 310 (신방학초등학교)-0.652582202921.232462938.81
2627서울서울특별시 종로구 송월동 1번지 (서울기상대)0.038765196776.981452738.5