Overview

Dataset statistics

Number of variables3
Number of observations24
Missing cells2
Missing cells (%)2.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory732.0 B
Average record size in memory30.5 B

Variable types

DateTime1
Categorical1
Numeric1

Dataset

Description경주시시설관리공단에서 운영하고 있는 경주국민체육센터의 월별 이용객 수 입니다.(2021년 1월부터 2022년 11월까지 포함하고 있습니다.)
Author경주시시설관리공단
URLhttps://www.data.go.kr/data/15095668/fileData.do

Alerts

이용객수 is highly overall correlated with 시설물High correlation
시설물 is highly overall correlated with 이용객수 High correlation
시설물 is highly imbalanced (75.0%)Imbalance
구분 has 1 (4.2%) missing valuesMissing
이용객수 has 1 (4.2%) missing valuesMissing
이용객수 has 5 (20.8%) zerosZeros

Reproduction

Analysis started2023-12-12 00:44:17.120817
Analysis finished2023-12-12 00:44:17.682499
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Date

MISSING 

Distinct23
Distinct (%)100.0%
Missing1
Missing (%)4.2%
Memory size324.0 B
Minimum2021-01-21 00:00:00
Maximum2022-11-22 00:00:00
2023-12-12T09:44:17.722923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:44:17.808460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)

시설물
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size324.0 B
경주국민체육센터
23 
<NA>
 
1

Length

Max length8
Median length8
Mean length7.8333333
Min length4

Unique

Unique1 ?
Unique (%)4.2%

Sample

1st row경주국민체육센터
2nd row경주국민체육센터
3rd row경주국민체육센터
4th row경주국민체육센터
5th row경주국민체육센터

Common Values

ValueCountFrequency (%)
경주국민체육센터 23
95.8%
<NA> 1
 
4.2%

Length

2023-12-12T09:44:17.913242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:44:18.007539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경주국민체육센터 23
95.8%
na 1
 
4.2%

이용객수
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct19
Distinct (%)82.6%
Missing1
Missing (%)4.2%
Infinite0
Infinite (%)0.0%
Mean7351.6522
Minimum0
Maximum17680
Zeros5
Zeros (%)20.8%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-12T09:44:18.082971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11755
median7298
Q311677
95-th percentile16890.6
Maximum17680
Range17680
Interquartile range (IQR)9922

Descriptive statistics

Standard deviation6169.4475
Coefficient of variation (CV)0.83919198
Kurtosis-1.2365116
Mean7351.6522
Median Absolute Deviation (MAD)5353
Skewness0.32901856
Sum169088
Variance38062083
MonotonicityNot monotonic
2023-12-12T09:44:18.177484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
0 5
20.8%
1565 1
 
4.2%
16985 1
 
4.2%
16041 1
 
4.2%
15780 1
 
4.2%
17680 1
 
4.2%
15801 1
 
4.2%
11709 1
 
4.2%
10313 1
 
4.2%
5769 1
 
4.2%
Other values (9) 9
37.5%
ValueCountFrequency (%)
0 5
20.8%
1565 1
 
4.2%
1945 1
 
4.2%
2535 1
 
4.2%
4182 1
 
4.2%
5581 1
 
4.2%
5769 1
 
4.2%
7298 1
 
4.2%
7733 1
 
4.2%
8018 1
 
4.2%
ValueCountFrequency (%)
17680 1
4.2%
16985 1
4.2%
16041 1
4.2%
15801 1
4.2%
15780 1
4.2%
11709 1
4.2%
11645 1
4.2%
10313 1
4.2%
8508 1
4.2%
8018 1
4.2%

Interactions

2023-12-12T09:44:17.199011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:44:18.261373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분이용객수
구분1.0001.000
이용객수1.0001.000
2023-12-12T09:44:18.331324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
이용객수시설물
이용객수1.0001.000
시설물1.0001.000

Missing values

2023-12-12T09:44:17.307379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:44:17.365978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T09:44:17.643960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분시설물이용객수
02021-01-21경주국민체육센터0
12021-02-21경주국민체육센터1565
22021-03-21경주국민체육센터7298
32021-04-21경주국민체육센터8018
42021-05-21경주국민체육센터5581
52021-06-21경주국민체육센터8508
62021-07-21경주국민체육센터11645
72021-08-21경주국민체육센터1945
82021-09-21경주국민체육센터0
92021-10-21경주국민체육센터2535
구분시설물이용객수
142022-03-22경주국민체육센터0
152022-04-22경주국민체육센터5769
162022-05-22경주국민체육센터10313
172022-06-22경주국민체육센터11709
182022-07-22경주국민체육센터15801
192022-08-22경주국민체육센터17680
202022-09-22경주국민체육센터15780
212022-10-22경주국민체육센터16041
222022-11-22경주국민체육센터16985
23<NA><NA><NA>