Overview

Dataset statistics

Number of variables9
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory76.8 B

Variable types

Numeric1
Categorical6
Text2

Dataset

Description수원도시공사 내 화산체육공원에서 보유중인 강좌 및 체육시설프로그램에 관한 이용내역 구분 사용월 사용시간 대상 수강료의 항목의 데이터를 반기별로 제공합니다.
Author수원도시공사
URLhttps://www.data.go.kr/data/15060745/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
번호 is highly overall correlated with 이용내역 and 1 other fieldsHigh correlation
이용내역 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
사용시간 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
대상 is highly imbalanced (81.3%)Imbalance
비고 is highly imbalanced (56.3%)Imbalance
번호 has unique valuesUnique

Reproduction

Analysis started2024-01-06 12:57:11.193251
Analysis finished2024-01-06 12:57:13.723448
Duration2.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.571429
Minimum1
Maximum36
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2024-01-06T12:57:14.021243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.7
Q19.5
median19
Q327.5
95-th percentile34.3
Maximum36
Range35
Interquartile range (IQR)18

Descriptive statistics

Standard deviation10.680619
Coefficient of variation (CV)0.57511027
Kurtosis-1.2508448
Mean18.571429
Median Absolute Deviation (MAD)9
Skewness-0.020879268
Sum650
Variance114.07563
MonotonicityStrictly increasing
2024-01-06T12:57:14.512731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
1 1
 
2.9%
2 1
 
2.9%
22 1
 
2.9%
23 1
 
2.9%
24 1
 
2.9%
25 1
 
2.9%
26 1
 
2.9%
27 1
 
2.9%
28 1
 
2.9%
29 1
 
2.9%
Other values (25) 25
71.4%
ValueCountFrequency (%)
1 1
2.9%
2 1
2.9%
3 1
2.9%
4 1
2.9%
5 1
2.9%
6 1
2.9%
7 1
2.9%
8 1
2.9%
9 1
2.9%
10 1
2.9%
ValueCountFrequency (%)
36 1
2.9%
35 1
2.9%
34 1
2.9%
33 1
2.9%
32 1
2.9%
31 1
2.9%
30 1
2.9%
29 1
2.9%
28 1
2.9%
27 1
2.9%

이용내역
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size412.0 B
화산체육공원사업소 골프시설이용 등록
13 
화산체육공원사업소 골프아카데미 레슨
10 
화산체육공원사업소 체육시설이용 테니스장
화산체육공원사업소 체육시설이용 축구장
화산체육공원사업소 골프시설이용 Par-3

Length

Max length22
Median length19
Mean length19.628571
Min length19

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row화산체육공원사업소 골프시설이용 등록
2nd row화산체육공원사업소 골프시설이용 등록
3rd row화산체육공원사업소 골프시설이용 등록
4th row화산체육공원사업소 골프시설이용 등록
5th row화산체육공원사업소 골프시설이용 등록

Common Values

ValueCountFrequency (%)
화산체육공원사업소 골프시설이용 등록 13
37.1%
화산체육공원사업소 골프아카데미 레슨 10
28.6%
화산체육공원사업소 체육시설이용 테니스장 6
17.1%
화산체육공원사업소 체육시설이용 축구장 4
 
11.4%
화산체육공원사업소 골프시설이용 Par-3 2
 
5.7%

Length

2024-01-06T12:57:15.016269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-06T12:57:15.462594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
화산체육공원사업소 35
33.3%
골프시설이용 15
14.3%
등록 13
 
12.4%
골프아카데미 10
 
9.5%
레슨 10
 
9.5%
체육시설이용 10
 
9.5%
테니스장 6
 
5.7%
축구장 4
 
3.8%
par-3 2
 
1.9%

구분
Text

Distinct19
Distinct (%)54.3%
Missing0
Missing (%)0.0%
Memory size412.0 B
2024-01-06T12:57:16.245120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length4
Mean length5.7428571
Min length2

Characters and Unicode

Total characters201
Distinct characters54
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)20.0%

Sample

1st row정기회원
2nd row정기회원
3rd row정기회원
4th row정기회원
5th row정기회원
ValueCountFrequency (%)
정기회원 5
 
12.2%
일일입장 3
 
7.3%
주중권 2
 
4.9%
레슨 2
 
4.9%
기타행사(1일 2
 
4.9%
일반레슨 2
 
4.9%
쿠폰레슨 2
 
4.9%
기타행사(1시간 2
 
4.9%
개인사용 2
 
4.9%
전용사용(1일 2
 
4.9%
Other values (14) 17
41.5%
2024-01-06T12:57:17.237652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
6.5%
10
 
5.0%
10
 
5.0%
( 10
 
5.0%
) 10
 
5.0%
9
 
4.5%
9
 
4.5%
9
 
4.5%
8
 
4.0%
8
 
4.0%
Other values (44) 105
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 152
75.6%
Decimal Number 15
 
7.5%
Open Punctuation 10
 
5.0%
Close Punctuation 10
 
5.0%
Space Separator 6
 
3.0%
Lowercase Letter 4
 
2.0%
Uppercase Letter 2
 
1.0%
Dash Punctuation 1
 
0.5%
Math Symbol 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
8.6%
10
 
6.6%
10
 
6.6%
9
 
5.9%
9
 
5.9%
9
 
5.9%
8
 
5.3%
8
 
5.3%
5
 
3.3%
4
 
2.6%
Other values (31) 67
44.1%
Decimal Number
ValueCountFrequency (%)
1 8
53.3%
9 3
 
20.0%
3 2
 
13.3%
2 1
 
6.7%
5 1
 
6.7%
Lowercase Letter
ValueCountFrequency (%)
r 2
50.0%
a 2
50.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Uppercase Letter
ValueCountFrequency (%)
P 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 152
75.6%
Common 43
 
21.4%
Latin 6
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
8.6%
10
 
6.6%
10
 
6.6%
9
 
5.9%
9
 
5.9%
9
 
5.9%
8
 
5.3%
8
 
5.3%
5
 
3.3%
4
 
2.6%
Other values (31) 67
44.1%
Common
ValueCountFrequency (%)
( 10
23.3%
) 10
23.3%
1 8
18.6%
6
14.0%
9 3
 
7.0%
3 2
 
4.7%
- 1
 
2.3%
2 1
 
2.3%
+ 1
 
2.3%
5 1
 
2.3%
Latin
ValueCountFrequency (%)
r 2
33.3%
a 2
33.3%
P 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 152
75.6%
ASCII 49
 
24.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
13
 
8.6%
10
 
6.6%
10
 
6.6%
9
 
5.9%
9
 
5.9%
9
 
5.9%
8
 
5.3%
8
 
5.3%
5
 
3.3%
4
 
2.6%
Other values (31) 67
44.1%
ASCII
ValueCountFrequency (%)
( 10
20.4%
) 10
20.4%
1 8
16.3%
6
12.2%
9 3
 
6.1%
r 2
 
4.1%
a 2
 
4.1%
P 2
 
4.1%
3 2
 
4.1%
- 1
 
2.0%
Other values (3) 3
 
6.1%
Distinct15
Distinct (%)42.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
1개월
평일
토/공휴일
일일권
1회
Other values (10)
11 

Length

Max length9
Median length5
Mean length3.2285714
Min length1

Unique

Unique9 ?
Unique (%)25.7%

Sample

1st row1개월
2nd row3개월
3rd row6개월
4th row12개월
5th row부부회원

Common Values

ValueCountFrequency (%)
1개월 7
20.0%
평일 6
17.1%
토/공휴일 5
14.3%
일일권 3
8.6%
1회 3
8.6%
3개월 2
 
5.7%
6개월 1
 
2.9%
12개월 1
 
2.9%
부부회원 1
 
2.9%
20매 1
 
2.9%
Other values (5) 5
14.3%

Length

2024-01-06T12:57:17.665652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1개월 7
19.4%
평일 6
16.7%
토/공휴일 5
13.9%
1회 4
11.1%
일일권 3
8.3%
3개월 2
 
5.6%
6개월 1
 
2.8%
12개월 1
 
2.8%
부부회원 1
 
2.8%
20매 1
 
2.8%
Other values (5) 5
13.9%

사용시간
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Memory size412.0 B
전용사용(1면)
16 
20분
1시간
3시간
40분
Other values (5)

Length

Max length8
Median length3
Mean length5.2857143
Min length3

Unique

Unique4 ?
Unique (%)11.4%

Sample

1st row전용사용(1면)
2nd row전용사용(1면)
3rd row전용사용(1면)
4th row전용사용(1면)
5th row전용사용(1면)

Common Values

ValueCountFrequency (%)
전용사용(1면) 16
45.7%
20분 5
 
14.3%
1시간 4
 
11.4%
3시간 2
 
5.7%
40분 2
 
5.7%
6시간 2
 
5.7%
30분 1
 
2.9%
60분 1
 
2.9%
90분 1
 
2.9%
70분 1
 
2.9%

Length

2024-01-06T12:57:18.135999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-06T12:57:18.496248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전용사용(1면 16
45.7%
20분 5
 
14.3%
1시간 4
 
11.4%
3시간 2
 
5.7%
40분 2
 
5.7%
6시간 2
 
5.7%
30분 1
 
2.9%
60분 1
 
2.9%
90분 1
 
2.9%
70분 1
 
2.9%

대상
Categorical

IMBALANCE 

Distinct2
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size412.0 B
일반
34 
일반
 
1

Length

Max length3
Median length2
Mean length2.0285714
Min length2

Unique

Unique1 ?
Unique (%)2.9%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 34
97.1%
일반 1
 
2.9%

Length

2024-01-06T12:57:18.953420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-06T12:57:19.356240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 35
100.0%
Distinct32
Distinct (%)91.4%
Missing0
Missing (%)0.0%
Memory size412.0 B
2024-01-06T12:57:19.719890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length7
Mean length5.7714286
Min length4

Characters and Unicode

Total characters202
Distinct characters11
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)85.7%

Sample

1st row130000
2nd row360000
3rd row660000
4th row1200000
5th row2200000
ValueCountFrequency (%)
100000 3
 
8.6%
300000 2
 
5.7%
130000 1
 
2.9%
18000 1
 
2.9%
500000 1
 
2.9%
200000 1
 
2.9%
250000 1
 
2.9%
450000 1
 
2.9%
1인200000/2인300000 1
 
2.9%
72500 1
 
2.9%
Other values (22) 22
62.9%
2024-01-06T12:57:20.733793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 140
69.3%
2 15
 
7.4%
1 12
 
5.9%
5 11
 
5.4%
3 7
 
3.5%
6 5
 
2.5%
4 4
 
2.0%
7 3
 
1.5%
8 2
 
1.0%
2
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 199
98.5%
Other Letter 2
 
1.0%
Other Punctuation 1
 
0.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 140
70.4%
2 15
 
7.5%
1 12
 
6.0%
5 11
 
5.5%
3 7
 
3.5%
6 5
 
2.5%
4 4
 
2.0%
7 3
 
1.5%
8 2
 
1.0%
Other Letter
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 200
99.0%
Hangul 2
 
1.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 140
70.0%
2 15
 
7.5%
1 12
 
6.0%
5 11
 
5.5%
3 7
 
3.5%
6 5
 
2.5%
4 4
 
2.0%
7 3
 
1.5%
8 2
 
1.0%
/ 1
 
0.5%
Hangul
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 200
99.0%
Hangul 2
 
1.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 140
70.0%
2 15
 
7.5%
1 12
 
6.0%
5 11
 
5.5%
3 7
 
3.5%
6 5
 
2.5%
4 4
 
2.0%
7 3
 
1.5%
8 2
 
1.0%
/ 1
 
0.5%
Hangul
ValueCountFrequency (%)
2
100.0%

비고
Categorical

IMBALANCE 

Distinct7
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
없음
28 
월 12회
 
2
주말 공휴일 사용불가 평일 18시이전 이용가능
 
1
<NA>
 
1
주 1회
 
1
Other values (2)
 
2

Length

Max length35
Median length2
Mean length3.9428571
Min length2

Unique

Unique5 ?
Unique (%)14.3%

Sample

1st row없음
2nd row없음
3rd row없음
4th row없음
5th row없음

Common Values

ValueCountFrequency (%)
없음 28
80.0%
월 12회 2
 
5.7%
주말 공휴일 사용불가 평일 18시이전 이용가능 1
 
2.9%
<NA> 1
 
2.9%
주 1회 1
 
2.9%
월 8회 1
 
2.9%
1인 200000원 / 2인 300000원 / 1팀당500000 1
 
2.9%

Length

2024-01-06T12:57:21.133476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-06T12:57:21.459578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
없음 28
56.0%
3
 
6.0%
12회 2
 
4.0%
2
 
4.0%
1회 1
 
2.0%
300000원 1
 
2.0%
2인 1
 
2.0%
200000원 1
 
2.0%
1인 1
 
2.0%
8회 1
 
2.0%
Other values (9) 9
 
18.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-11-30
35 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-11-30
2nd row2023-11-30
3rd row2023-11-30
4th row2023-11-30
5th row2023-11-30

Common Values

ValueCountFrequency (%)
2023-11-30 35
100.0%

Length

2024-01-06T12:57:22.024905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-06T12:57:22.392699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-11-30 35
100.0%

Interactions

2024-01-06T12:57:12.398231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-06T12:57:22.597440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호이용내역구분사용월_사용요일사용시간대상수강료(원)비고
번호1.0000.9630.9390.7550.8670.0000.9160.000
이용내역0.9631.0001.0000.8370.9380.0000.9670.000
구분0.9391.0001.0000.2890.9221.0000.0000.891
사용월_사용요일0.7550.8370.2891.0000.0000.0000.9490.000
사용시간0.8670.9380.9220.0001.0000.0000.0000.549
대상0.0000.0001.0000.0000.0001.0001.0000.000
수강료(원)0.9160.9670.0000.9490.0001.0001.0000.000
비고0.0000.0000.8910.0000.5490.0000.0001.000
2024-01-06T12:57:22.930713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용시간사용월_사용요일대상비고이용내역
사용시간1.0000.0000.0000.2890.602
사용월_사용요일0.0001.0000.0000.0000.412
대상0.0000.0001.0000.0000.000
비고0.2890.0000.0001.0000.000
이용내역0.6020.4120.0000.0001.000
2024-01-06T12:57:23.209680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호이용내역사용월_사용요일사용시간대상비고
번호1.0000.7720.3150.5080.2250.000
이용내역0.7721.0000.4120.6020.0000.000
사용월_사용요일0.3150.4121.0000.0000.0000.000
사용시간0.5080.6020.0001.0000.0000.289
대상0.2250.0000.0000.0001.0000.000
비고0.0000.0000.0000.2890.0001.000

Missing values

2024-01-06T12:57:12.836429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-06T12:57:13.426571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호이용내역구분사용월_사용요일사용시간대상수강료(원)비고데이터기준일자
01화산체육공원사업소 골프시설이용 등록정기회원1개월전용사용(1면)일반130000없음2023-11-30
12화산체육공원사업소 골프시설이용 등록정기회원3개월전용사용(1면)일반360000없음2023-11-30
23화산체육공원사업소 골프시설이용 등록정기회원6개월전용사용(1면)일반660000없음2023-11-30
34화산체육공원사업소 골프시설이용 등록정기회원12개월전용사용(1면)일반1200000없음2023-11-30
45화산체육공원사업소 골프시설이용 등록정기회원부부회원전용사용(1면)일반2200000없음2023-11-30
56화산체육공원사업소 골프시설이용 등록주중권1개월전용사용(1면)일반100000주말 공휴일 사용불가 평일 18시이전 이용가능2023-11-30
67화산체육공원사업소 골프시설이용 등록주중권3개월전용사용(1면)일반270000없음2023-11-30
78화산체육공원사업소 골프시설이용 등록쿠폰회원20매전용사용(1면)일반180000없음2023-11-30
89화산체육공원사업소 골프시설이용 등록쿠폰회원40매전용사용(1면)일반320000없음2023-11-30
910화산체육공원사업소 골프시설이용 등록일일입장일일권30분일반6000없음2023-11-30
번호이용내역구분사용월_사용요일사용시간대상수강료(원)비고데이터기준일자
2527화산체육공원사업소 골프아카데미 레슨그룹레슨(5인이상)1개월40분일반100000없음2023-11-30
2628화산체육공원사업소 골프아카데미 레슨쿠폰레슨일 1회6시간일반1인200000/2인300000<NA>2023-11-30
2729화산체육공원사업소 골프아카데미 레슨쿠폰레슨10회20분일반300000주 1회2023-11-30
2830화산체육공원사업소 골프아카데미 레슨일반레슨1개월20분일반300000월 12회2023-11-30
2931화산체육공원사업소 골프아카데미 레슨일반 + Par3(2회)1개월20분일반450000없음2023-11-30
3032화산체육공원사업소 골프아카데미 레슨일반레슨1개월20분일반250000월 8회2023-11-30
3133화산체육공원사업소 골프아카데미 레슨필드레슨1회6시간일반2000001인 200000원 / 2인 300000원 / 1팀당5000002023-11-30
3234화산체육공원사업소 골프아카데미 레슨특별레슨1개월40분일반500000월 12회2023-11-30
3335화산체육공원사업소 골프아카데미 레슨원포인트 레슨1회20분일반50000없음2023-11-30
3436화산체육공원사업소 골프아카데미 레슨Par-3 및 9홀 레슨1회70분일반100000없음2023-11-30