Overview

Dataset statistics

Number of variables5
Number of observations112
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.7 KiB
Average record size in memory43.2 B

Variable types

Text2
Numeric2
Categorical1

Dataset

Description해당 데이터는 대구광역시 서구 관내의 공원 및 녹지 내 야외 운동기구의 위치 및 종류등의 항목을 포함한 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15101364/fileData.do

Alerts

체육기구(점) is highly overall correlated with 면 적(제곱미터)High correlation
면 적(제곱미터) is highly overall correlated with 체육기구(점)High correlation

Reproduction

Analysis started2023-12-12 06:28:34.310383
Analysis finished2023-12-12 06:28:35.095783
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

위치
Text

Distinct110
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T15:28:35.358383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length8.5625
Min length3

Characters and Unicode

Total characters959
Distinct characters152
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique109 ?
Unique (%)97.3%

Sample

1st row감삼못공원
2nd row평리공원
3rd row이현공원 체력단련장
4th row이현공원 서편
5th row가르뱅이공원
ValueCountFrequency (%)
철로변완충녹지 24
 
14.0%
염색공단 6
 
3.5%
어린이놀이터 4
 
2.3%
달서천로 4
 
2.3%
문화원 3
 
1.7%
2
 
1.2%
2
 
1.2%
내당2.3동 2
 
1.2%
와룡산 2
 
1.2%
교통섬 2
 
1.2%
Other values (119) 121
70.3%
2023-12-12T15:28:35.834240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
60
 
6.3%
57
 
5.9%
51
 
5.3%
50
 
5.2%
) 37
 
3.9%
( 37
 
3.9%
33
 
3.4%
32
 
3.3%
30
 
3.1%
30
 
3.1%
Other values (142) 542
56.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 762
79.5%
Space Separator 60
 
6.3%
Decimal Number 59
 
6.2%
Close Punctuation 37
 
3.9%
Open Punctuation 37
 
3.9%
Other Punctuation 4
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
7.5%
51
 
6.7%
50
 
6.6%
33
 
4.3%
32
 
4.2%
30
 
3.9%
30
 
3.9%
28
 
3.7%
26
 
3.4%
24
 
3.1%
Other values (128) 401
52.6%
Decimal Number
ValueCountFrequency (%)
1 14
23.7%
2 12
20.3%
3 9
15.3%
4 8
13.6%
6 5
 
8.5%
5 4
 
6.8%
7 3
 
5.1%
8 2
 
3.4%
0 1
 
1.7%
9 1
 
1.7%
Space Separator
ValueCountFrequency (%)
60
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Other Punctuation
ValueCountFrequency (%)
. 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 762
79.5%
Common 197
 
20.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
7.5%
51
 
6.7%
50
 
6.6%
33
 
4.3%
32
 
4.2%
30
 
3.9%
30
 
3.9%
28
 
3.7%
26
 
3.4%
24
 
3.1%
Other values (128) 401
52.6%
Common
ValueCountFrequency (%)
60
30.5%
) 37
18.8%
( 37
18.8%
1 14
 
7.1%
2 12
 
6.1%
3 9
 
4.6%
4 8
 
4.1%
6 5
 
2.5%
5 4
 
2.0%
. 4
 
2.0%
Other values (4) 7
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 762
79.5%
ASCII 197
 
20.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
60
30.5%
) 37
18.8%
( 37
18.8%
1 14
 
7.1%
2 12
 
6.1%
3 9
 
4.6%
4 8
 
4.1%
6 5
 
2.5%
5 4
 
2.0%
. 4
 
2.0%
Other values (4) 7
 
3.6%
Hangul
ValueCountFrequency (%)
57
 
7.5%
51
 
6.7%
50
 
6.6%
33
 
4.3%
32
 
4.2%
30
 
3.9%
30
 
3.9%
28
 
3.7%
26
 
3.4%
24
 
3.1%
Other values (128) 401
52.6%

주소
Text

Distinct110
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T15:28:36.158025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length21
Mean length18.357143
Min length15

Characters and Unicode

Total characters2056
Distinct characters34
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)96.4%

Sample

1st row대구광역시 서구 내당동 463-7
2nd row대구광역시 서구 평리동 1230-1
3rd row대구광역시 서구 이현동 산28-15
4th row대구광역시 서구 이현동 산28-18
5th row대구광역시 서구 상리동 산264-5
ValueCountFrequency (%)
대구광역시 112
25.0%
서구 112
25.0%
비산동 46
10.3%
평리동 21
 
4.7%
중리동 11
 
2.5%
내당동 10
 
2.2%
이현동 8
 
1.8%
원대동1가 6
 
1.3%
상리동 6
 
1.3%
원대동3가 3
 
0.7%
Other values (111) 113
25.2%
2023-12-12T15:28:36.641435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
336
16.3%
224
 
10.9%
121
 
5.9%
112
 
5.4%
112
 
5.4%
112
 
5.4%
112
 
5.4%
111
 
5.4%
1 105
 
5.1%
- 91
 
4.4%
Other values (24) 620
30.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1140
55.4%
Decimal Number 489
23.8%
Space Separator 336
 
16.3%
Dash Punctuation 91
 
4.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
224
19.6%
121
10.6%
112
9.8%
112
9.8%
112
9.8%
112
9.8%
111
9.7%
56
 
4.9%
46
 
4.0%
38
 
3.3%
Other values (12) 96
8.4%
Decimal Number
ValueCountFrequency (%)
1 105
21.5%
2 60
12.3%
0 45
9.2%
8 44
9.0%
3 43
8.8%
4 42
 
8.6%
7 42
 
8.6%
5 39
 
8.0%
6 38
 
7.8%
9 31
 
6.3%
Space Separator
ValueCountFrequency (%)
336
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 91
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1140
55.4%
Common 916
44.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
224
19.6%
121
10.6%
112
9.8%
112
9.8%
112
9.8%
112
9.8%
111
9.7%
56
 
4.9%
46
 
4.0%
38
 
3.3%
Other values (12) 96
8.4%
Common
ValueCountFrequency (%)
336
36.7%
1 105
 
11.5%
- 91
 
9.9%
2 60
 
6.6%
0 45
 
4.9%
8 44
 
4.8%
3 43
 
4.7%
4 42
 
4.6%
7 42
 
4.6%
5 39
 
4.3%
Other values (2) 69
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1140
55.4%
ASCII 916
44.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
336
36.7%
1 105
 
11.5%
- 91
 
9.9%
2 60
 
6.6%
0 45
 
4.9%
8 44
 
4.8%
3 43
 
4.7%
4 42
 
4.6%
7 42
 
4.6%
5 39
 
4.3%
Other values (2) 69
 
7.5%
Hangul
ValueCountFrequency (%)
224
19.6%
121
10.6%
112
9.8%
112
9.8%
112
9.8%
112
9.8%
111
9.7%
56
 
4.9%
46
 
4.0%
38
 
3.3%
Other values (12) 96
8.4%

체육기구(점)
Real number (ℝ)

HIGH CORRELATION 

Distinct18
Distinct (%)16.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.0178571
Minimum1
Maximum24
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-12T15:28:36.777652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q14
median6
Q39
95-th percentile14
Maximum24
Range23
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.924931
Coefficient of variation (CV)0.55927771
Kurtosis2.7657523
Mean7.0178571
Median Absolute Deviation (MAD)2
Skewness1.2534649
Sum786
Variance15.405084
MonotonicityNot monotonic
2023-12-12T15:28:36.951283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
6 15
13.4%
4 15
13.4%
5 11
9.8%
7 11
9.8%
9 11
9.8%
3 10
8.9%
10 8
7.1%
8 6
 
5.4%
11 6
 
5.4%
2 4
 
3.6%
Other values (8) 15
13.4%
ValueCountFrequency (%)
1 4
 
3.6%
2 4
 
3.6%
3 10
8.9%
4 15
13.4%
5 11
9.8%
6 15
13.4%
7 11
9.8%
8 6
 
5.4%
9 11
9.8%
10 8
7.1%
ValueCountFrequency (%)
24 1
 
0.9%
19 1
 
0.9%
17 1
 
0.9%
16 1
 
0.9%
14 3
 
2.7%
13 3
 
2.7%
12 1
 
0.9%
11 6
5.4%
10 8
7.1%
9 11
9.8%

면 적(제곱미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct15
Distinct (%)13.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.285714
Minimum5
Maximum80
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-12T15:28:37.091100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile10
Q120
median25
Q335
95-th percentile56.7
Maximum80
Range75
Interquartile range (IQR)15

Descriptive statistics

Standard deviation13.637144
Coefficient of variation (CV)0.4997906
Kurtosis2.5887453
Mean27.285714
Median Absolute Deviation (MAD)7.5
Skewness1.2983365
Sum3056
Variance185.97169
MonotonicityNot monotonic
2023-12-12T15:28:37.237178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
25 21
18.8%
20 18
16.1%
35 17
15.2%
30 17
15.2%
15 15
13.4%
10 6
 
5.4%
40 4
 
3.6%
5 4
 
3.6%
66 2
 
1.8%
50 2
 
1.8%
Other values (5) 6
 
5.4%
ValueCountFrequency (%)
5 4
 
3.6%
10 6
 
5.4%
15 15
13.4%
20 18
16.1%
25 21
18.8%
30 17
15.2%
35 17
15.2%
40 4
 
3.6%
45 1
 
0.9%
50 2
 
1.8%
ValueCountFrequency (%)
80 1
 
0.9%
66 2
 
1.8%
65 2
 
1.8%
60 1
 
0.9%
54 1
 
0.9%
50 2
 
1.8%
45 1
 
0.9%
40 4
 
3.6%
35 17
15.2%
30 17
15.2%

비 고
Categorical

Distinct11
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
철로변 완충녹지
24 
쌈지공원
17 
어린이공원
15 
어린이놀이터 및 공공공지
12 
서대구공단 완충녹지
11 
Other values (6)
33 

Length

Max length13
Median length9
Mean length7.1339286
Min length3

Unique

Unique1 ?
Unique (%)0.9%

Sample

1st row근린공원
2nd row근린공원
3rd row근린공원
4th row근린공원
5th row근린공원

Common Values

ValueCountFrequency (%)
철로변 완충녹지 24
21.4%
쌈지공원 17
15.2%
어린이공원 15
13.4%
어린이놀이터 및 공공공지 12
10.7%
서대구공단 완충녹지 11
9.8%
동네체육시설 8
 
7.1%
근린공원 7
 
6.2%
교내 체육시설 7
 
6.2%
염색공단 완충녹지 6
 
5.4%
시설녹지 4
 
3.6%

Length

2023-12-12T15:28:37.415185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
완충녹지 41
22.3%
철로변 24
13.0%
쌈지공원 17
9.2%
어린이공원 15
 
8.2%
어린이놀이터 12
 
6.5%
12
 
6.5%
공공공지 12
 
6.5%
서대구공단 11
 
6.0%
동네체육시설 8
 
4.3%
근린공원 7
 
3.8%
Other values (5) 25
13.6%

Interactions

2023-12-12T15:28:34.720725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:28:34.530045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:28:34.831080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:28:34.614470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:28:37.513278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
체육기구(점)면 적(제곱미터)비 고
체육기구(점)1.0000.7840.567
면 적(제곱미터)0.7841.0000.711
비 고0.5670.7111.000
2023-12-12T15:28:37.653404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
체육기구(점)면 적(제곱미터)비 고
체육기구(점)1.0000.7820.295
면 적(제곱미터)0.7821.0000.393
비 고0.2950.3931.000

Missing values

2023-12-12T15:28:34.950172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:28:35.055402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

위치주소체육기구(점)면 적(제곱미터)비 고
0감삼못공원대구광역시 서구 내당동 463-71466근린공원
1평리공원대구광역시 서구 평리동 1230-11650근린공원
2이현공원 체력단련장대구광역시 서구 이현동 산28-152460근린공원
3이현공원 서편대구광역시 서구 이현동 산28-181054근린공원
4가르뱅이공원대구광역시 서구 상리동 산264-5625근린공원
5상리공원 광장대구광역시 서구 중리동 1180-41066근린공원
6상리공원 정상대구광역시 서구 중리동 산1981035근린공원
7황제공원대구광역시 서구 내당동 11-141030어린이공원
8경운공원대구광역시 서구 내당동 252-4835어린이공원
9삼익공원대구광역시 서구 내당동 308-11025어린이공원
위치주소체육기구(점)면 적(제곱미터)비 고
102누리쌈지공원대구광역시 서구 비산동 368-1115쌈지공원
103다솜쌈지공원대구광역시 서구 비산동 437-1315쌈지공원
104효사각쌈지공원대구광역시 서구 원대동3가 1300-1625쌈지공원
105꽃담쌈지공원대구광역시 서구 내당동 1015-67415쌈지공원
106새방골쌈지공원대구광역시 서구 상리동 591-1110쌈지공원
107평리1동쌈지공원대구광역시 서구 평리동 844-26310쌈지공원
108비산4동쌈지공원대구광역시 서구 비산동 315-187210쌈지공원
109내당2.3동쌈지공원대구광역시 서구 내당동 891-37415쌈지공원
110비산4동쌈지공원(2)대구광역시 서구 비산동 293-38315쌈지공원
111인동촌 쌈지공원대구광역시 서구 비산동 22-8520쌈지공원