Overview

Dataset statistics

Number of variables5
Number of observations144
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.2 KiB
Average record size in memory43.9 B

Variable types

Numeric2
Categorical3

Dataset

Description대전광역시 서구 야외운동기구에 대한 데이터로 야외운동기구의 설치위치, 종류, 설치대수, 설치년도에 대한 정보를 제공합니다.
Author대전광역시 서구
URLhttps://www.data.go.kr/data/15067228/fileData.do

Alerts

연번 is highly overall correlated with 설치년도 and 1 other fieldsHigh correlation
설치년도 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
설치위치 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
종류 is highly overall correlated with 설치대수High correlation
설치대수 is highly overall correlated with 종류High correlation
설치대수 is highly imbalanced (65.9%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 15:26:16.937961
Analysis finished2024-03-14 15:26:18.400241
Duration1.46 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct144
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72.5
Minimum1
Maximum144
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-03-15T00:26:18.617349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.15
Q136.75
median72.5
Q3108.25
95-th percentile136.85
Maximum144
Range143
Interquartile range (IQR)71.5

Descriptive statistics

Standard deviation41.713307
Coefficient of variation (CV)0.57535596
Kurtosis-1.2
Mean72.5
Median Absolute Deviation (MAD)36
Skewness0
Sum10440
Variance1740
MonotonicityStrictly increasing
2024-03-15T00:26:19.306651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
74 1
 
0.7%
94 1
 
0.7%
95 1
 
0.7%
96 1
 
0.7%
97 1
 
0.7%
98 1
 
0.7%
99 1
 
0.7%
100 1
 
0.7%
101 1
 
0.7%
Other values (134) 134
93.1%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
144 1
0.7%
143 1
0.7%
142 1
0.7%
141 1
0.7%
140 1
0.7%
139 1
0.7%
138 1
0.7%
137 1
0.7%
136 1
0.7%
135 1
0.7%

설치위치
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)20.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
복수동 산19-4
11 
가수원동 157-1
10 
용촌동 575-3
 
8
산직동 74-11
 
8
우명동 251-1
 
8
Other values (24)
99 

Length

Max length11
Median length10
Mean length8.5138889
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row복수동 산19-4
2nd row복수동 산19-4
3rd row복수동 산19-4
4th row복수동 산19-4
5th row복수동 산19-4

Common Values

ValueCountFrequency (%)
복수동 산19-4 11
 
7.6%
가수원동 157-1 10
 
6.9%
용촌동 575-3 8
 
5.6%
산직동 74-11 8
 
5.6%
우명동 251-1 8
 
5.6%
흑석동 621-1 7
 
4.9%
정림동 290-3 6
 
4.2%
평촌2동 683-13 6
 
4.2%
용촌동 330-1 6
 
4.2%
봉곡동 519 6
 
4.2%
Other values (19) 68
47.2%

Length

2024-03-15T00:26:19.796329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
용촌동 18
 
6.2%
정림동 14
 
4.9%
산직동 12
 
4.2%
월평동 12
 
4.2%
우명동 12
 
4.2%
복수동 11
 
3.8%
산19-4 11
 
3.8%
괴곡동 10
 
3.5%
157-1 10
 
3.5%
가수원동 10
 
3.5%
Other values (36) 168
58.3%

종류
Categorical

HIGH CORRELATION 

Distinct43
Distinct (%)29.9%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
허리돌리기
17 
하늘걷기
11 
에어서핑
 
8
에어워킹
 
7
달리기운동
 
7
Other values (38)
94 

Length

Max length7
Median length6
Mean length4.7222222
Min length3

Unique

Unique16 ?
Unique (%)11.1%

Sample

1st row에어서핑
2nd row체스트프레스
3rd row트윈트위스트
4th row어깨돌리기
5th row마사지롤라

Common Values

ValueCountFrequency (%)
허리돌리기 17
 
11.8%
하늘걷기 11
 
7.6%
에어서핑 8
 
5.6%
에어워킹 7
 
4.9%
달리기운동 7
 
4.9%
트위스트 6
 
4.2%
워밍쇼올더 6
 
4.2%
파도타기 6
 
4.2%
마사지롤라 6
 
4.2%
노르딕머신 5
 
3.5%
Other values (33) 65
45.1%

Length

2024-03-15T00:26:20.242840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
허리돌리기 17
 
11.8%
하늘걷기 11
 
7.6%
에어서핑 8
 
5.6%
달리기운동 7
 
4.9%
에어워킹 7
 
4.9%
트위스트 6
 
4.2%
워밍쇼올더 6
 
4.2%
파도타기 6
 
4.2%
마사지롤라 6
 
4.2%
어깨풀기 5
 
3.5%
Other values (33) 65
45.1%

설치대수
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
1
129 
2
13 
3
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row2
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1 129
89.6%
2 13
 
9.0%
3 2
 
1.4%

Length

2024-03-15T00:26:20.653935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:26:20.977875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 129
89.6%
2 13
 
9.0%
3 2
 
1.4%

설치년도
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2013.1875
Minimum1996
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-03-15T00:26:21.289359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1996
5-th percentile1996
Q12011
median2014
Q32017
95-th percentile2020.7
Maximum2021
Range25
Interquartile range (IQR)6

Descriptive statistics

Standard deviation6.3366718
Coefficient of variation (CV)0.0031475815
Kurtosis1.7504378
Mean2013.1875
Median Absolute Deviation (MAD)3
Skewness-1.469081
Sum289899
Variance40.153409
MonotonicityNot monotonic
2024-03-15T00:26:21.657948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2016 34
23.6%
2014 24
16.7%
2019 22
15.3%
1996 11
 
7.6%
2011 11
 
7.6%
2005 8
 
5.6%
2013 8
 
5.6%
2021 8
 
5.6%
2017 7
 
4.9%
2007 6
 
4.2%
Other values (2) 5
 
3.5%
ValueCountFrequency (%)
1996 11
 
7.6%
2005 8
 
5.6%
2007 6
 
4.2%
2009 3
 
2.1%
2011 11
 
7.6%
2012 2
 
1.4%
2013 8
 
5.6%
2014 24
16.7%
2016 34
23.6%
2017 7
 
4.9%
ValueCountFrequency (%)
2021 8
 
5.6%
2019 22
15.3%
2017 7
 
4.9%
2016 34
23.6%
2014 24
16.7%
2013 8
 
5.6%
2012 2
 
1.4%
2011 11
 
7.6%
2009 3
 
2.1%
2007 6
 
4.2%

Interactions

2024-03-15T00:26:17.742741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:26:17.247934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:26:17.895699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:26:17.505121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T00:26:21.891287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설치위치종류설치대수설치년도
연번1.0000.9890.7570.4930.906
설치위치0.9891.0000.0000.6891.000
종류0.7570.0001.0000.9230.623
설치대수0.4930.6890.9231.0000.213
설치년도0.9061.0000.6230.2131.000
2024-03-15T00:26:22.057728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종류설치위치설치대수
종류1.0000.0000.654
설치위치0.0001.0000.419
설치대수0.6540.4191.000
2024-03-15T00:26:22.207544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설치년도설치위치종류설치대수
연번1.0000.8240.8470.3240.331
설치년도0.8241.0000.9200.2750.289
설치위치0.8470.9201.0000.0000.419
종류0.3240.2750.0001.0000.654
설치대수0.3310.2890.4190.6541.000

Missing values

2024-03-15T00:26:18.083647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T00:26:18.260511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번설치위치종류설치대수설치년도
01복수동 산19-4에어서핑11996
12복수동 산19-4체스트프레스11996
23복수동 산19-4트윈트위스트21996
34복수동 산19-4어깨돌리기21996
45복수동 산19-4마사지롤라11996
56복수동 산19-4에어워킹11996
67복수동 산19-4노르딕머신11996
78복수동 산19-4로우잉머신11996
89복수동 산19-4싸이클링31996
910복수동 산19-4스쿼트머신31996
연번설치위치종류설치대수설치년도
134135평촌2동 683-13공중걷기12019
135136평촌2동 683-13달리기운동12019
136137도마동 210공중걷기12021
137138도마동 210다리뻗치기12021
138139도마동 210허리돌리기12021
139140도마동 210큰활차12021
140141용촌동 330-1오금펴기12021
141142용촌동 330-1하늘걷기12021
142143월평동 60-1파도타기12021
143144월평동 60-1하늘걷기12021