Overview

Dataset statistics

Number of variables9
Number of observations66
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.0 KiB
Average record size in memory77.0 B

Variable types

Numeric2
Categorical3
Text1
Boolean2
DateTime1

Dataset

Description대구공공시설관리공단 체육시설 매출정보입니다.순번, 센터코드, 품목명, 판매금액, 요금적용가능요일, 일일매표단체상품최소기준인원, 사용여부, 강좌자유이용가능여부, 등록일시로 구성되어있습니다.
Author대구공공시설관리공단
URLhttps://www.data.go.kr/data/15120506/fileData.do

Alerts

강좌자유이용가능여부 has constant value ""Constant
순번 is highly overall correlated with 일일매표단체상품최소기준인원High correlation
센터코드 is highly overall correlated with 요금적용가능요일High correlation
요금적용가능요일 is highly overall correlated with 센터코드High correlation
일일매표단체상품최소기준인원 is highly overall correlated with 순번High correlation
일일매표단체상품최소기준인원 is highly imbalanced (55.8%)Imbalance
사용여부 is highly imbalanced (73.3%)Imbalance
순번 has unique valuesUnique
품목명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 02:10:04.442390
Analysis finished2023-12-12 02:10:05.519279
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct66
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.5
Minimum1
Maximum66
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size726.0 B
2023-12-12T11:10:05.634436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.25
Q117.25
median33.5
Q349.75
95-th percentile62.75
Maximum66
Range65
Interquartile range (IQR)32.5

Descriptive statistics

Standard deviation19.196354
Coefficient of variation (CV)0.57302549
Kurtosis-1.2
Mean33.5
Median Absolute Deviation (MAD)16.5
Skewness0
Sum2211
Variance368.5
MonotonicityStrictly increasing
2023-12-12T11:10:05.778569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.5%
51 1
 
1.5%
37 1
 
1.5%
38 1
 
1.5%
39 1
 
1.5%
40 1
 
1.5%
41 1
 
1.5%
42 1
 
1.5%
43 1
 
1.5%
44 1
 
1.5%
Other values (56) 56
84.8%
ValueCountFrequency (%)
1 1
1.5%
2 1
1.5%
3 1
1.5%
4 1
1.5%
5 1
1.5%
6 1
1.5%
7 1
1.5%
8 1
1.5%
9 1
1.5%
10 1
1.5%
ValueCountFrequency (%)
66 1
1.5%
65 1
1.5%
64 1
1.5%
63 1
1.5%
62 1
1.5%
61 1
1.5%
60 1
1.5%
59 1
1.5%
58 1
1.5%
57 1
1.5%

센터코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size660.0 B
서재문화체육센터
37 
두류수영장
21 
대구실내빙상장

Length

Max length8
Median length8
Mean length6.9242424
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row두류수영장
2nd row대구실내빙상장
3rd row두류수영장
4th row두류수영장
5th row두류수영장

Common Values

ValueCountFrequency (%)
서재문화체육센터 37
56.1%
두류수영장 21
31.8%
대구실내빙상장 8
 
12.1%

Length

2023-12-12T11:10:05.928753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:10:06.053039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서재문화체육센터 37
56.1%
두류수영장 21
31.8%
대구실내빙상장 8
 
12.1%

품목명
Text

UNIQUE 

Distinct66
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size660.0 B
2023-12-12T11:10:06.302814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length10.621212
Min length2

Characters and Unicode

Total characters701
Distinct characters80
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)100.0%

Sample

1st row자유수영(성인)8회
2nd row정규반[피겨]-토
3rd row자유수영(청소년)8회
4th row자유수영(어린이/경로)8회
5th row자유수영(성인)_3회
ValueCountFrequency (%)
경노 4
 
4.3%
성인 4
 
4.3%
배드민턴 3
 
3.2%
자유수영(5회 3
 
3.2%
단체 3
 
3.2%
주2회 2
 
2.2%
청소년_군인_여자 2
 
2.2%
청소년 2
 
2.2%
자유수영(6회 2
 
2.2%
주3회 2
 
2.2%
Other values (58) 66
71.0%
2023-12-12T11:10:06.755439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 47
 
6.7%
) 47
 
6.7%
39
 
5.6%
34
 
4.9%
31
 
4.4%
30
 
4.3%
_ 28
 
4.0%
27
 
3.9%
25
 
3.6%
24
 
3.4%
Other values (70) 369
52.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 462
65.9%
Decimal Number 59
 
8.4%
Open Punctuation 53
 
7.6%
Close Punctuation 53
 
7.6%
Connector Punctuation 28
 
4.0%
Space Separator 27
 
3.9%
Other Punctuation 10
 
1.4%
Dash Punctuation 6
 
0.9%
Math Symbol 2
 
0.3%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
8.4%
34
 
7.4%
31
 
6.7%
30
 
6.5%
25
 
5.4%
24
 
5.2%
16
 
3.5%
14
 
3.0%
13
 
2.8%
12
 
2.6%
Other values (49) 224
48.5%
Decimal Number
ValueCountFrequency (%)
5 13
22.0%
3 12
20.3%
1 11
18.6%
8 7
11.9%
2 6
10.2%
6 5
 
8.5%
0 3
 
5.1%
4 1
 
1.7%
9 1
 
1.7%
Other Punctuation
ValueCountFrequency (%)
/ 4
40.0%
. 3
30.0%
% 3
30.0%
Open Punctuation
ValueCountFrequency (%)
( 47
88.7%
[ 6
 
11.3%
Close Punctuation
ValueCountFrequency (%)
) 47
88.7%
] 6
 
11.3%
Connector Punctuation
ValueCountFrequency (%)
_ 28
100.0%
Space Separator
ValueCountFrequency (%)
27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
P 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 462
65.9%
Common 238
34.0%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
8.4%
34
 
7.4%
31
 
6.7%
30
 
6.5%
25
 
5.4%
24
 
5.2%
16
 
3.5%
14
 
3.0%
13
 
2.8%
12
 
2.6%
Other values (49) 224
48.5%
Common
ValueCountFrequency (%)
( 47
19.7%
) 47
19.7%
_ 28
11.8%
27
11.3%
5 13
 
5.5%
3 12
 
5.0%
1 11
 
4.6%
8 7
 
2.9%
- 6
 
2.5%
2 6
 
2.5%
Other values (10) 34
14.3%
Latin
ValueCountFrequency (%)
P 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 462
65.9%
ASCII 239
34.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 47
19.7%
) 47
19.7%
_ 28
11.7%
27
11.3%
5 13
 
5.4%
3 12
 
5.0%
1 11
 
4.6%
8 7
 
2.9%
- 6
 
2.5%
2 6
 
2.5%
Other values (11) 35
14.6%
Hangul
ValueCountFrequency (%)
39
 
8.4%
34
 
7.4%
31
 
6.7%
30
 
6.5%
25
 
5.4%
24
 
5.2%
16
 
3.5%
14
 
3.0%
13
 
2.8%
12
 
2.6%
Other values (49) 224
48.5%

판매금액
Real number (ℝ)

Distinct47
Distinct (%)71.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28338.333
Minimum930
Maximum150000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size726.0 B
2023-12-12T11:10:06.914778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum930
5-th percentile1080
Q19925
median20500
Q338225
95-th percentile75000
Maximum150000
Range149070
Interquartile range (IQR)28300

Descriptive statistics

Standard deviation30068.819
Coefficient of variation (CV)1.0610652
Kurtosis6.9337018
Mean28338.333
Median Absolute Deviation (MAD)14500
Skewness2.3549861
Sum1870330
Variance9.0413388 × 108
MonotonicityNot monotonic
2023-12-12T11:10:07.141669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
24000 4
 
6.1%
75000 3
 
4.5%
40000 3
 
4.5%
1000 3
 
4.5%
28000 2
 
3.0%
18000 2
 
3.0%
1870 2
 
3.0%
35000 2
 
3.0%
150000 2
 
3.0%
50000 2
 
3.0%
Other values (37) 41
62.1%
ValueCountFrequency (%)
930 1
 
1.5%
1000 3
4.5%
1320 1
 
1.5%
1370 1
 
1.5%
1650 1
 
1.5%
1870 2
3.0%
2310 1
 
1.5%
2750 1
 
1.5%
3300 1
 
1.5%
5610 1
 
1.5%
ValueCountFrequency (%)
150000 2
3.0%
100000 1
 
1.5%
75000 3
4.5%
60000 1
 
1.5%
50000 2
3.0%
46000 2
3.0%
44000 1
 
1.5%
42000 1
 
1.5%
40000 3
4.5%
38500 1
 
1.5%

요금적용가능요일
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)18.2%
Missing0
Missing (%)0.0%
Memory size660.0 B
월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일
33 
월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일
12 
월요일, 화요일, 수요일, 목요일, 금요일
화요일, 목요일
월요일, 화요일, 수요일, 목요일, 금요일, 토요일
 
3
Other values (7)

Length

Max length38
Median length35.5
Mean length27.727273
Min length3

Unique

Unique5 ?
Unique (%)7.6%

Sample

1st row월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일
2nd row토요일
3rd row월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일
4th row월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일
5th row일요일

Common Values

ValueCountFrequency (%)
월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일 33
50.0%
월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일 12
 
18.2%
월요일, 화요일, 수요일, 목요일, 금요일 5
 
7.6%
화요일, 목요일 4
 
6.1%
월요일, 화요일, 수요일, 목요일, 금요일, 토요일 3
 
4.5%
<NA> 2
 
3.0%
월요일, 수요일, 금요일 2
 
3.0%
토요일 1
 
1.5%
일요일 1
 
1.5%
월요일 1
 
1.5%
Other values (2) 2
 
3.0%

Length

2023-12-12T11:10:07.287720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
화요일 58
14.8%
목요일 57
14.5%
월요일 56
14.3%
수요일 56
14.3%
금요일 56
14.3%
토요일 49
12.5%
일요일 46
11.7%
공휴일 12
 
3.1%
na 2
 
0.5%

일일매표단체상품최소기준인원
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size660.0 B
20
57 
0
30
 
3

Length

Max length2
Median length2
Mean length1.9090909
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row20
4th row20
5th row20

Common Values

ValueCountFrequency (%)
20 57
86.4%
0 6
 
9.1%
30 3
 
4.5%

Length

2023-12-12T11:10:07.421567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:10:07.557691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 57
86.4%
0 6
 
9.1%
30 3
 
4.5%

사용여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size198.0 B
True
63 
False
 
3
ValueCountFrequency (%)
True 63
95.5%
False 3
 
4.5%
2023-12-12T11:10:07.669712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size198.0 B
True
66 
ValueCountFrequency (%)
True 66
100.0%
2023-12-12T11:10:07.765891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct59
Distinct (%)89.4%
Missing0
Missing (%)0.0%
Memory size660.0 B
Minimum2020-05-27 14:05:00
Maximum2021-03-22 09:11:00
2023-12-12T11:10:07.893408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:10:08.046540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T11:10:05.080999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:10:04.901688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:10:05.180284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:10:04.988395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:10:08.147625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번센터코드품목명판매금액요금적용가능요일일일매표단체상품최소기준인원사용여부등록일시
순번1.0000.6631.0000.5330.6020.7110.0000.994
센터코드0.6631.0001.0000.5490.7290.4620.0001.000
품목명1.0001.0001.0001.0001.0001.0001.0001.000
판매금액0.5330.5491.0001.0000.7440.2170.2270.967
요금적용가능요일0.6020.7291.0000.7441.0000.6480.0000.981
일일매표단체상품최소기준인원0.7110.4621.0000.2170.6481.0000.0000.000
사용여부0.0000.0001.0000.2270.0000.0001.0001.000
등록일시0.9941.0001.0000.9670.9810.0001.0001.000
2023-12-12T11:10:08.278428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용여부일일매표단체상품최소기준인원센터코드요금적용가능요일
사용여부1.0000.0000.0000.000
일일매표단체상품최소기준인원0.0001.0000.1790.450
센터코드0.0000.1791.0000.539
요금적용가능요일0.0000.4500.5391.000
2023-12-12T11:10:08.396730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번판매금액센터코드요금적용가능요일일일매표단체상품최소기준인원사용여부
순번1.000-0.1340.4430.2880.5350.000
판매금액-0.1341.0000.4250.4760.1730.234
센터코드0.4430.4251.0000.5390.1790.000
요금적용가능요일0.2880.4760.5391.0000.4500.000
일일매표단체상품최소기준인원0.5350.1730.1790.4501.0000.000
사용여부0.0000.2340.0000.0000.0001.000

Missing values

2023-12-12T11:10:05.318858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:10:05.457264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번센터코드품목명판매금액요금적용가능요일일일매표단체상품최소기준인원사용여부강좌자유이용가능여부등록일시
01두류수영장자유수영(성인)8회28000월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일20YY2020-06-12 12:50
12대구실내빙상장정규반[피겨]-토100000토요일20YY2020-06-12 15:03
23두류수영장자유수영(청소년)8회24000월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일20YY2020-06-12 15:46
34두류수영장자유수영(어린이/경로)8회16000월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일20YY2020-06-12 15:46
45두류수영장자유수영(성인)_3회10500일요일20YY2020-06-12 15:47
56두류수영장자유수영(청소년)_3회9000월요일20YY2020-06-12 15:48
67두류수영장자유수영(어린이/경로)_3회6000화요일20YY2020-06-12 15:49
78두류수영장자유헬스(성인)8회24000월요일, 화요일, 수요일, 목요일, 금요일, 토요일20YY2020-06-12 17:58
89두류수영장자유헬스(청소년)8회20000월요일, 화요일, 수요일, 목요일, 금요일20YY2020-06-12 18:00
910대구실내빙상장정규스케이트대여(4회)12000<NA>20YY2020-06-13 10:28
순번센터코드품목명판매금액요금적용가능요일일일매표단체상품최소기준인원사용여부강좌자유이용가능여부등록일시
5657두류수영장자유수영(어린이/경로)5회10000월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일20YY2020-07-31 19:23
5758서재문화체육센터코 배드민턴 (3주)18000월요일, 화요일, 수요일, 목요일, 금요일20YY2020-10-30 09:16
5859서재문화체육센터청소년_군인_여자 수영2750월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일0YY2021-01-07 14:50
5960서재문화체육센터어린이_노인_여자 수영1870월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일0YY2021-01-07 14:50
6061서재문화체육센터단체 수영_여자(어른)2310월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일30YY2021-01-07 14:52
6162서재문화체육센터단체 수영_여자(청소년_군인)1870월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일30YY2021-01-07 14:53
6263서재문화체육센터단체 수영_여자(어린이_노인)1320월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일30YY2021-01-07 14:54
6364서재문화체육센터어른 수영_여자(50%)1650월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일0YY2021-01-07 14:54
6465서재문화체육센터청소년_군인_여자 수영(50%)1370월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일0YY2021-01-07 14:55
6566서재문화체육센터어린이_노인_여자 수영(50%)930월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일0YY2021-01-07 14:55