Overview

Dataset statistics

Number of variables9
Number of observations68
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.1 KiB
Average record size in memory76.9 B

Variable types

Numeric2
Categorical3
Text1
Boolean2
DateTime1

Dataset

Description대구시설공단 체육시설 매출정보입니다.
Author대구시설공단
URLhttps://www.data.go.kr/data/15088096/fileData.do

Alerts

강좌자유이용가능여부 has constant value ""Constant
센터코드 is highly overall correlated with 요금적용가능요일High correlation
요금적용가능요일 is highly overall correlated with 센터코드High correlation
일일매표단체상품최소기준인원 is highly imbalanced (51.1%)Imbalance
사용여부 is highly imbalanced (73.9%)Imbalance
순번 has unique valuesUnique
품목명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:26:04.663543
Analysis finished2023-12-12 12:26:05.750306
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct68
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.5
Minimum1
Maximum68
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size744.0 B
2023-12-12T21:26:05.848913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.35
Q117.75
median34.5
Q351.25
95-th percentile64.65
Maximum68
Range67
Interquartile range (IQR)33.5

Descriptive statistics

Standard deviation19.77372
Coefficient of variation (CV)0.5731513
Kurtosis-1.2
Mean34.5
Median Absolute Deviation (MAD)17
Skewness0
Sum2346
Variance391
MonotonicityStrictly increasing
2023-12-12T21:26:06.019125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.5%
45 1
 
1.5%
51 1
 
1.5%
50 1
 
1.5%
49 1
 
1.5%
48 1
 
1.5%
47 1
 
1.5%
46 1
 
1.5%
44 1
 
1.5%
36 1
 
1.5%
Other values (58) 58
85.3%
ValueCountFrequency (%)
1 1
1.5%
2 1
1.5%
3 1
1.5%
4 1
1.5%
5 1
1.5%
6 1
1.5%
7 1
1.5%
8 1
1.5%
9 1
1.5%
10 1
1.5%
ValueCountFrequency (%)
68 1
1.5%
67 1
1.5%
66 1
1.5%
65 1
1.5%
64 1
1.5%
63 1
1.5%
62 1
1.5%
61 1
1.5%
60 1
1.5%
59 1
1.5%

센터코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size676.0 B
서재문화체육센터
37 
두류수영장
23 
대구실내빙상장

Length

Max length8
Median length8
Mean length6.8676471
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row두류수영장
2nd row대구실내빙상장
3rd row두류수영장
4th row두류수영장
5th row두류수영장

Common Values

ValueCountFrequency (%)
서재문화체육센터 37
54.4%
두류수영장 23
33.8%
대구실내빙상장 8
 
11.8%

Length

2023-12-12T21:26:06.185427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:26:06.303104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서재문화체육센터 37
54.4%
두류수영장 23
33.8%
대구실내빙상장 8
 
11.8%

품목명
Text

UNIQUE 

Distinct68
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-12-12T21:26:06.568937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length10.602941
Min length2

Characters and Unicode

Total characters721
Distinct characters81
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)100.0%

Sample

1st row자유수영(성인)8회
2nd row정규반[피겨]-토
3rd row자유수영(청소년)8회
4th row자유수영(어린이/경로)8회
5th row자유수영(성인)_3회
ValueCountFrequency (%)
경노 4
 
4.2%
성인 4
 
4.2%
배드민턴 3
 
3.2%
단체 3
 
3.2%
자유수영(5회 3
 
3.2%
수영(50 2
 
2.1%
주2회 2
 
2.1%
청소년 2
 
2.1%
주3회 2
 
2.1%
레슨 2
 
2.1%
Other values (60) 68
71.6%
2023-12-12T21:26:07.080432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 49
 
6.8%
) 49
 
6.8%
39
 
5.4%
34
 
4.7%
31
 
4.3%
30
 
4.2%
_ 28
 
3.9%
27
 
3.7%
25
 
3.5%
25
 
3.5%
Other values (71) 384
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 477
66.2%
Decimal Number 59
 
8.2%
Open Punctuation 55
 
7.6%
Close Punctuation 55
 
7.6%
Connector Punctuation 28
 
3.9%
Space Separator 27
 
3.7%
Other Punctuation 11
 
1.5%
Dash Punctuation 6
 
0.8%
Math Symbol 2
 
0.3%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
8.2%
34
 
7.1%
31
 
6.5%
30
 
6.3%
25
 
5.2%
25
 
5.2%
17
 
3.6%
14
 
2.9%
13
 
2.7%
13
 
2.7%
Other values (49) 236
49.5%
Decimal Number
ValueCountFrequency (%)
5 13
22.0%
3 12
20.3%
1 11
18.6%
8 7
11.9%
2 6
10.2%
6 5
 
8.5%
0 3
 
5.1%
4 1
 
1.7%
9 1
 
1.7%
Other Punctuation
ValueCountFrequency (%)
/ 4
36.4%
. 3
27.3%
% 3
27.3%
, 1
 
9.1%
Open Punctuation
ValueCountFrequency (%)
( 49
89.1%
[ 6
 
10.9%
Close Punctuation
ValueCountFrequency (%)
) 49
89.1%
] 6
 
10.9%
Connector Punctuation
ValueCountFrequency (%)
_ 28
100.0%
Space Separator
ValueCountFrequency (%)
27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
P 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 477
66.2%
Common 243
33.7%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
8.2%
34
 
7.1%
31
 
6.5%
30
 
6.3%
25
 
5.2%
25
 
5.2%
17
 
3.6%
14
 
2.9%
13
 
2.7%
13
 
2.7%
Other values (49) 236
49.5%
Common
ValueCountFrequency (%)
( 49
20.2%
) 49
20.2%
_ 28
11.5%
27
11.1%
5 13
 
5.3%
3 12
 
4.9%
1 11
 
4.5%
8 7
 
2.9%
- 6
 
2.5%
] 6
 
2.5%
Other values (11) 35
14.4%
Latin
ValueCountFrequency (%)
P 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 477
66.2%
ASCII 244
33.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 49
20.1%
) 49
20.1%
_ 28
11.5%
27
11.1%
5 13
 
5.3%
3 12
 
4.9%
1 11
 
4.5%
8 7
 
2.9%
- 6
 
2.5%
] 6
 
2.5%
Other values (12) 36
14.8%
Hangul
ValueCountFrequency (%)
39
 
8.2%
34
 
7.1%
31
 
6.5%
30
 
6.3%
25
 
5.2%
25
 
5.2%
17
 
3.6%
14
 
2.9%
13
 
2.7%
13
 
2.7%
Other values (49) 236
49.5%

판매금액
Real number (ℝ)

Distinct49
Distinct (%)72.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27571.029
Minimum930
Maximum150000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size744.0 B
2023-12-12T21:26:07.300503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum930
5-th percentile1112
Q19262.5
median19900
Q337675
95-th percentile75000
Maximum150000
Range149070
Interquartile range (IQR)28412.5

Descriptive statistics

Standard deviation29947.714
Coefficient of variation (CV)1.0862022
Kurtosis7.0413326
Mean27571.029
Median Absolute Deviation (MAD)14695
Skewness2.3692678
Sum1874830
Variance8.9686558 × 108
MonotonicityNot monotonic
2023-12-12T21:26:07.517854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
24000 4
 
5.9%
40000 3
 
4.4%
1000 3
 
4.4%
75000 3
 
4.4%
1870 2
 
2.9%
46000 2
 
2.9%
32000 2
 
2.9%
50000 2
 
2.9%
150000 2
 
2.9%
18000 2
 
2.9%
Other values (39) 43
63.2%
ValueCountFrequency (%)
930 1
 
1.5%
1000 3
4.4%
1320 1
 
1.5%
1370 1
 
1.5%
1650 1
 
1.5%
1870 2
2.9%
2000 1
 
1.5%
2310 1
 
1.5%
2500 1
 
1.5%
2750 1
 
1.5%
ValueCountFrequency (%)
150000 2
2.9%
100000 1
 
1.5%
75000 3
4.4%
60000 1
 
1.5%
50000 2
2.9%
46000 2
2.9%
44000 1
 
1.5%
42000 1
 
1.5%
40000 3
4.4%
38500 1
 
1.5%

요금적용가능요일
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)17.6%
Missing0
Missing (%)0.0%
Memory size676.0 B
월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일
33 
월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일
12 
월요일, 화요일, 수요일, 목요일, 금요일
-
화요일, 목요일
Other values (7)
10 

Length

Max length38
Median length33
Mean length26.852941
Min length1

Unique

Unique5 ?
Unique (%)7.4%

Sample

1st row월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일
2nd row토요일
3rd row월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일
4th row월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일
5th row일요일

Common Values

ValueCountFrequency (%)
월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일 33
48.5%
월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일 12
 
17.6%
월요일, 화요일, 수요일, 목요일, 금요일 5
 
7.4%
- 4
 
5.9%
화요일, 목요일 4
 
5.9%
월요일, 화요일, 수요일, 목요일, 금요일, 토요일 3
 
4.4%
월요일, 수요일, 금요일 2
 
2.9%
토요일 1
 
1.5%
일요일 1
 
1.5%
월요일 1
 
1.5%
Other values (2) 2
 
2.9%

Length

2023-12-12T21:26:07.724983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
화요일 58
14.7%
목요일 57
14.5%
월요일 56
14.2%
수요일 56
14.2%
금요일 56
14.2%
토요일 49
12.4%
일요일 46
11.7%
공휴일 12
 
3.0%
4
 
1.0%
Distinct3
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size676.0 B
20
57 
0
30
 
3

Length

Max length2
Median length2
Mean length1.8823529
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row20
4th row20
5th row20

Common Values

ValueCountFrequency (%)
20 57
83.8%
0 8
 
11.8%
30 3
 
4.4%

Length

2023-12-12T21:26:08.266354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:26:08.406560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 57
83.8%
0 8
 
11.8%
30 3
 
4.4%

사용여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size200.0 B
True
65 
False
 
3
ValueCountFrequency (%)
True 65
95.6%
False 3
 
4.4%
2023-12-12T21:26:08.538150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size200.0 B
True
68 
ValueCountFrequency (%)
True 68
100.0%
2023-12-12T21:26:08.647202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct61
Distinct (%)89.7%
Missing0
Missing (%)0.0%
Memory size676.0 B
Minimum2020-05-27 14:05:00
Maximum2021-03-22 09:11:00
2023-12-12T21:26:08.789047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:26:08.997960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T21:26:05.298127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:26:05.091737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:26:05.395802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:26:05.193161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:26:09.141767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번센터코드품목명판매금액요금적용가능요일일일매표단체상품최소기준인원사용여부등록일시
순번1.0000.5441.0000.5720.5090.6660.0000.994
센터코드0.5441.0001.0000.5420.8280.2620.0001.000
품목명1.0001.0001.0001.0001.0001.0001.0001.000
판매금액0.5720.5421.0001.0000.7550.2990.2350.966
요금적용가능요일0.5090.8281.0000.7551.0000.7500.0000.984
일일매표단체상품최소기준인원0.6660.2621.0000.2990.7501.0000.0000.316
사용여부0.0000.0001.0000.2350.0000.0001.0001.000
등록일시0.9941.0001.0000.9660.9840.3161.0001.000
2023-12-12T21:26:09.306584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
요금적용가능요일사용여부일일매표단체상품최소기준인원센터코드
요금적용가능요일1.0000.0000.4330.515
사용여부0.0001.0000.0000.000
일일매표단체상품최소기준인원0.4330.0001.0000.083
센터코드0.5150.0000.0831.000
2023-12-12T21:26:09.444140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번판매금액센터코드요금적용가능요일일일매표단체상품최소기준인원사용여부
순번1.000-0.1980.3640.2320.4870.000
판매금액-0.1981.0000.4180.4800.2230.243
센터코드0.3640.4181.0000.5150.0830.000
요금적용가능요일0.2320.4800.5151.0000.4330.000
일일매표단체상품최소기준인원0.4870.2230.0830.4331.0000.000
사용여부0.0000.2430.0000.0000.0001.000

Missing values

2023-12-12T21:26:05.521691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:26:05.678149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번센터코드품목명판매금액요금적용가능요일일일매표단체상품최소기준인원사용여부강좌자유이용가능여부등록일시
01두류수영장자유수영(성인)8회28000월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일20YY2020-06-12 12:50
12대구실내빙상장정규반[피겨]-토100000토요일20YY2020-06-12 15:03
23두류수영장자유수영(청소년)8회24000월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일20YY2020-06-12 15:46
34두류수영장자유수영(어린이/경로)8회16000월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일20YY2020-06-12 15:46
45두류수영장자유수영(성인)_3회10500일요일20YY2020-06-12 15:47
56두류수영장자유수영(청소년)_3회9000월요일20YY2020-06-12 15:48
67두류수영장자유수영(어린이/경로)_3회6000화요일20YY2020-06-12 15:49
78두류수영장자유헬스(성인)8회24000월요일, 화요일, 수요일, 목요일, 금요일, 토요일20YY2020-06-12 17:58
89두류수영장자유헬스(청소년)8회20000월요일, 화요일, 수요일, 목요일, 금요일20YY2020-06-12 18:00
910대구실내빙상장정규스케이트대여(4회)12000-20YY2020-06-13 10:28
순번센터코드품목명판매금액요금적용가능요일일일매표단체상품최소기준인원사용여부강좌자유이용가능여부등록일시
5859서재문화체육센터청소년_군인_여자 수영2750월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일0YY2021-01-07 14:50
5960서재문화체육센터어린이_노인_여자 수영1870월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일0YY2021-01-07 14:50
6061서재문화체육센터단체 수영_여자(어른)2310월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일30YY2021-01-07 14:52
6162서재문화체육센터단체 수영_여자(청소년_군인)1870월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일30YY2021-01-07 14:53
6263서재문화체육센터단체 수영_여자(어린이_노인)1320월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일30YY2021-01-07 14:54
6364서재문화체육센터어른 수영_여자(50%)1650월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일0YY2021-01-07 14:54
6465서재문화체육센터청소년_군인_여자 수영(50%)1370월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일0YY2021-01-07 14:55
6566서재문화체육센터어린이_노인_여자 수영(50%)930월요일, 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일0YY2021-01-07 14:55
6667두류수영장배드민턴(성인)2500-0YY2021-01-21 14:14
6768두류수영장배드민턴(청소년,경로)2000-0YY2021-01-21 14:16