Overview

Dataset statistics

Number of variables12
Number of observations2958
Missing cells2
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory289.0 KiB
Average record size in memory100.0 B

Variable types

Numeric4
Categorical5
Text3

Dataset

Description올림픽공원 체조경기장 올림픽스포츠센터 강좌정보입니다.현재 수강가능한 강좌정보를 제공해드립니다.강좌일련번호, 강좌명, 수강월, 수강기간, 정원, 수강료, 기간 시작일, 종료일 등 정보 제공
Author한국체육산업개발주식회사
URLhttps://www.data.go.kr/data/15110953/fileData.do

Alerts

시작월 is highly overall correlated with 정원 and 3 other fieldsHigh correlation
시작일자 is highly overall correlated with 정원 and 3 other fieldsHigh correlation
일련번호 is highly overall correlated with 강습반High correlation
정원 is highly overall correlated with 금액 and 4 other fieldsHigh correlation
금액 is highly overall correlated with 정원 and 1 other fieldsHigh correlation
강좌코드명 is highly overall correlated with 정원 and 4 other fieldsHigh correlation
요일 is highly overall correlated with 강좌코드명 and 1 other fieldsHigh correlation
강습반 is highly overall correlated with 일련번호 and 6 other fieldsHigh correlation
강좌코드명 is highly imbalanced (77.9%)Imbalance
요일 is highly imbalanced (50.1%)Imbalance
강습반 is highly imbalanced (79.1%)Imbalance
정원 is highly skewed (γ1 = 27.28193177)Skewed

Reproduction

Analysis started2023-12-12 22:19:04.534629
Analysis finished2023-12-12 22:19:07.650766
Duration3.12 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct1413
Distinct (%)47.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean666255.44
Minimum4005
Maximum1017468
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.1 KiB
2023-12-13T07:19:07.745868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4005
5-th percentile4126
Q121724
median1001371
Q31005669.8
95-th percentile1012298
Maximum1017468
Range1013463
Interquartile range (IQR)983945.75

Descriptive statistics

Standard deviation470566.54
Coefficient of variation (CV)0.70628547
Kurtosis-1.5502524
Mean666255.44
Median Absolute Deviation (MAD)6072.5
Skewness-0.67084974
Sum1.9707836 × 109
Variance2.2143287 × 1011
MonotonicityNot monotonic
2023-12-13T07:19:07.925085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1005618 4
 
0.1%
4103 4
 
0.1%
4261 4
 
0.1%
4260 4
 
0.1%
4141 4
 
0.1%
4259 4
 
0.1%
4258 4
 
0.1%
4257 4
 
0.1%
4256 4
 
0.1%
4254 4
 
0.1%
Other values (1403) 2918
98.6%
ValueCountFrequency (%)
4005 1
< 0.1%
4016 1
< 0.1%
4024 2
0.1%
4029 2
0.1%
4030 2
0.1%
4033 2
0.1%
4034 2
0.1%
4036 2
0.1%
4038 2
0.1%
4040 2
0.1%
ValueCountFrequency (%)
1017468 1
< 0.1%
1017034 1
< 0.1%
1017033 1
< 0.1%
1017032 1
< 0.1%
1017031 1
< 0.1%
1017030 1
< 0.1%
1017029 1
< 0.1%
1017028 1
< 0.1%
1016859 1
< 0.1%
1016858 1
< 0.1%

강좌번호
Real number (ℝ)

Distinct72
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.613252
Minimum0
Maximum151
Zeros12
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size26.1 KiB
2023-12-13T07:19:08.058599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile7
Q115
median22
Q327
95-th percentile38
Maximum151
Range151
Interquartile range (IQR)12

Descriptive statistics

Standard deviation10.626336
Coefficient of variation (CV)0.49165837
Kurtosis26.855297
Mean21.613252
Median Absolute Deviation (MAD)6
Skewness2.89669
Sum63932
Variance112.91903
MonotonicityNot monotonic
2023-12-13T07:19:08.547553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
22 199
 
6.7%
23 162
 
5.5%
27 147
 
5.0%
21 138
 
4.7%
25 131
 
4.4%
26 131
 
4.4%
20 127
 
4.3%
15 114
 
3.9%
28 113
 
3.8%
13 110
 
3.7%
Other values (62) 1586
53.6%
ValueCountFrequency (%)
0 12
 
0.4%
1 6
 
0.2%
2 19
 
0.6%
3 19
 
0.6%
4 27
0.9%
5 24
0.8%
6 33
1.1%
7 16
 
0.5%
8 40
1.4%
9 53
1.8%
ValueCountFrequency (%)
151 1
< 0.1%
147 1
< 0.1%
137 1
< 0.1%
124 1
< 0.1%
120 1
< 0.1%
117 1
< 0.1%
80 1
< 0.1%
78 1
< 0.1%
76 1
< 0.1%
75 1
< 0.1%

시작월
Categorical

HIGH CORRELATION 

Distinct42
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size23.2 KiB
2022-08-01
1077 
2020-01-01
333 
2022-07-01
266 
2022-02-01
257 
2023-01-01
234 
Other values (37)
791 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique9 ?
Unique (%)0.3%

Sample

1st row2023-01-01
2nd row2023-01-01
3rd row2023-01-01
4th row2023-01-01
5th row2023-01-01

Common Values

ValueCountFrequency (%)
2022-08-01 1077
36.4%
2020-01-01 333
 
11.3%
2022-07-01 266
 
9.0%
2022-02-01 257
 
8.7%
2023-01-01 234
 
7.9%
2023-07-01 167
 
5.6%
2020-08-01 100
 
3.4%
2022-11-01 80
 
2.7%
2021-05-01 67
 
2.3%
2023-03-01 67
 
2.3%
Other values (32) 310
 
10.5%

Length

2023-12-13T07:19:08.681418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-08-01 1077
36.4%
2020-01-01 333
 
11.3%
2022-07-01 266
 
9.0%
2022-02-01 257
 
8.7%
2023-01-01 234
 
7.9%
2023-07-01 167
 
5.6%
2020-08-01 100
 
3.4%
2022-11-01 80
 
2.7%
2021-05-01 67
 
2.3%
2023-03-01 67
 
2.3%
Other values (32) 310
 
10.5%

강좌코드명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct38
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size23.2 KiB
테니스
2535 
골프
 
94
당구
 
70
실내승마교실
 
37
탁구
 
36
Other values (33)
 
186

Length

Max length9
Median length3
Mean length3.0831643
Min length2

Unique

Unique7 ?
Unique (%)0.2%

Sample

1st row검도
2nd row검도
3rd row검도
4th row검도
5th row검도

Common Values

ValueCountFrequency (%)
테니스 2535
85.7%
골프 94
 
3.2%
당구 70
 
2.4%
실내승마교실 37
 
1.3%
탁구 36
 
1.2%
댄스스포츠 22
 
0.7%
실내축구 18
 
0.6%
배드민턴 16
 
0.5%
로드 사이클 14
 
0.5%
배드민턴 단기속성 14
 
0.5%
Other values (28) 102
 
3.4%

Length

2023-12-13T07:19:08.817373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
테니스 2535
84.8%
골프 94
 
3.1%
당구 70
 
2.3%
실내승마교실 37
 
1.2%
탁구 36
 
1.2%
배드민턴 30
 
1.0%
댄스스포츠 22
 
0.7%
실내축구 18
 
0.6%
탁구단기속성 14
 
0.5%
단기속성 14
 
0.5%
Other values (30) 118
 
3.9%
Distinct409
Distinct (%)13.8%
Missing0
Missing (%)0.0%
Memory size23.2 KiB
2023-12-13T07:19:09.138849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length24
Mean length13.075727
Min length4

Characters and Unicode

Total characters38678
Distinct characters198
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique247 ?
Unique (%)8.4%

Sample

1st row검도 1800-1930 (월화목금) 청소년
2nd row검도 1800-1930 (월화목금) 성인
3rd row검도 1930-2100 (월화목금) 성인
4th row검도 0600-0730 (월화목금) 성인
5th row검도 0600-0730 (월화목금) 청소년
ValueCountFrequency (%)
테니스(평일 675
 
10.2%
테니스(주말복수 367
 
5.5%
테니스(주말 361
 
5.4%
실외 318
 
4.8%
실외테니스(주말 298
 
4.5%
실외테니스(주말복수 270
 
4.1%
최태훈 202
 
3.0%
전동호 200
 
3.0%
최성하 185
 
2.8%
이동헌 182
 
2.7%
Other values (281) 3583
54.0%
2023-12-13T07:19:09.643445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3855
 
10.0%
( 2438
 
6.3%
) 2438
 
6.3%
2411
 
6.2%
2386
 
6.2%
2381
 
6.2%
1476
 
3.8%
1345
 
3.5%
1339
 
3.5%
1305
 
3.4%
Other values (188) 17304
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27131
70.1%
Space Separator 3855
 
10.0%
Open Punctuation 2438
 
6.3%
Close Punctuation 2438
 
6.3%
Decimal Number 2046
 
5.3%
Other Punctuation 449
 
1.2%
Dash Punctuation 164
 
0.4%
Math Symbol 134
 
0.3%
Uppercase Letter 12
 
< 0.1%
Lowercase Letter 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2411
 
8.9%
2386
 
8.8%
2381
 
8.8%
1476
 
5.4%
1345
 
5.0%
1339
 
4.9%
1305
 
4.8%
1016
 
3.7%
749
 
2.8%
702
 
2.6%
Other values (159) 12021
44.3%
Decimal Number
ValueCountFrequency (%)
0 657
32.1%
1 592
28.9%
3 161
 
7.9%
2 145
 
7.1%
6 117
 
5.7%
8 87
 
4.3%
5 83
 
4.1%
9 82
 
4.0%
4 75
 
3.7%
7 47
 
2.3%
Uppercase Letter
ValueCountFrequency (%)
P 4
33.3%
T 2
16.7%
E 2
16.7%
N 2
16.7%
S 2
16.7%
Other Punctuation
ValueCountFrequency (%)
/ 250
55.7%
: 153
34.1%
. 32
 
7.1%
, 14
 
3.1%
Lowercase Letter
ValueCountFrequency (%)
o 8
80.0%
e 1
 
10.0%
x 1
 
10.0%
Math Symbol
ValueCountFrequency (%)
~ 128
95.5%
+ 6
 
4.5%
Space Separator
ValueCountFrequency (%)
3855
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2438
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2438
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 164
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27131
70.1%
Common 11525
29.8%
Latin 22
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2411
 
8.9%
2386
 
8.8%
2381
 
8.8%
1476
 
5.4%
1345
 
5.0%
1339
 
4.9%
1305
 
4.8%
1016
 
3.7%
749
 
2.8%
702
 
2.6%
Other values (159) 12021
44.3%
Common
ValueCountFrequency (%)
3855
33.4%
( 2438
21.2%
) 2438
21.2%
0 657
 
5.7%
1 592
 
5.1%
/ 250
 
2.2%
- 164
 
1.4%
3 161
 
1.4%
: 153
 
1.3%
2 145
 
1.3%
Other values (11) 672
 
5.8%
Latin
ValueCountFrequency (%)
o 8
36.4%
P 4
18.2%
T 2
 
9.1%
E 2
 
9.1%
N 2
 
9.1%
S 2
 
9.1%
e 1
 
4.5%
x 1
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27131
70.1%
ASCII 11547
29.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3855
33.4%
( 2438
21.1%
) 2438
21.1%
0 657
 
5.7%
1 592
 
5.1%
/ 250
 
2.2%
- 164
 
1.4%
3 161
 
1.4%
: 153
 
1.3%
2 145
 
1.3%
Other values (19) 694
 
6.0%
Hangul
ValueCountFrequency (%)
2411
 
8.9%
2386
 
8.8%
2381
 
8.8%
1476
 
5.4%
1345
 
5.0%
1339
 
4.9%
1305
 
4.8%
1016
 
3.7%
749
 
2.8%
702
 
2.6%
Other values (159) 12021
44.3%
Distinct146
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size23.2 KiB
2023-12-13T07:19:09.958308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length11
Mean length11
Min length11

Characters and Unicode

Total characters32538
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.3%

Sample

1st row18:00~19:30
2nd row18:00~19:30
3rd row19:30~21:00
4th row06:00~07:30
5th row06:00~07:30
ValueCountFrequency (%)
06:30~07:00 58
 
2.0%
10:00~10:30 58
 
2.0%
07:30~08:00 57
 
1.9%
09:00~09:30 56
 
1.9%
11:30~12:00 56
 
1.9%
10:30~11:00 56
 
1.9%
06:00~06:30 56
 
1.9%
08:00~08:30 56
 
1.9%
11:00~11:30 56
 
1.9%
08:30~09:00 54
 
1.8%
Other values (136) 2395
81.0%
2023-12-13T07:19:10.392010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 11164
34.3%
: 5916
18.2%
1 4276
 
13.1%
~ 2958
 
9.1%
3 1813
 
5.6%
2 1623
 
5.0%
4 1101
 
3.4%
8 834
 
2.6%
7 790
 
2.4%
9 768
 
2.4%
Other values (2) 1295
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 23664
72.7%
Other Punctuation 5916
 
18.2%
Math Symbol 2958
 
9.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 11164
47.2%
1 4276
 
18.1%
3 1813
 
7.7%
2 1623
 
6.9%
4 1101
 
4.7%
8 834
 
3.5%
7 790
 
3.3%
9 768
 
3.2%
6 762
 
3.2%
5 533
 
2.3%
Other Punctuation
ValueCountFrequency (%)
: 5916
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2958
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 32538
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 11164
34.3%
: 5916
18.2%
1 4276
 
13.1%
~ 2958
 
9.1%
3 1813
 
5.6%
2 1623
 
5.0%
4 1101
 
3.4%
8 834
 
2.6%
7 790
 
2.4%
9 768
 
2.4%
Other values (2) 1295
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 32538
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 11164
34.3%
: 5916
18.2%
1 4276
 
13.1%
~ 2958
 
9.1%
3 1813
 
5.6%
2 1623
 
5.0%
4 1101
 
3.4%
8 834
 
2.6%
7 790
 
2.4%
9 768
 
2.4%
Other values (2) 1295
 
4.0%

정원
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct32
Distinct (%)1.1%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean4.5380453
Minimum1
Maximum900
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.1 KiB
2023-12-13T07:19:10.535907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile15
Maximum900
Range899
Interquartile range (IQR)1

Descriptive statistics

Standard deviation29.946552
Coefficient of variation (CV)6.5989981
Kurtosis810.23981
Mean4.5380453
Median Absolute Deviation (MAD)0
Skewness27.281932
Sum13419
Variance896.796
MonotonicityNot monotonic
2023-12-13T07:19:10.637970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
1 1844
62.3%
2 722
 
24.4%
8 78
 
2.6%
5 58
 
2.0%
6 31
 
1.0%
40 30
 
1.0%
15 29
 
1.0%
60 25
 
0.8%
4 21
 
0.7%
20 18
 
0.6%
Other values (22) 101
 
3.4%
ValueCountFrequency (%)
1 1844
62.3%
2 722
 
24.4%
3 2
 
0.1%
4 21
 
0.7%
5 58
 
2.0%
6 31
 
1.0%
7 15
 
0.5%
8 78
 
2.6%
9 1
 
< 0.1%
10 9
 
0.3%
ValueCountFrequency (%)
900 3
 
0.1%
70 7
 
0.2%
69 1
 
< 0.1%
60 25
0.8%
59 2
 
0.1%
55 4
 
0.1%
54 1
 
< 0.1%
50 6
 
0.2%
49 1
 
< 0.1%
47 2
 
0.1%

요일
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct19
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size23.2 KiB
토,일
1305 
월,화,목,금
1083 
165 
월,수,금
 
88
화,목
 
76
Other values (14)
241 

Length

Max length13
Median length11
Mean length4.7822853
Min length1

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row월,화,목,금,토
2nd row월,화,목,금,토
3rd row월,화,목,금,토
4th row월,화,목,금,토
5th row월,화,목,금,토

Common Values

ValueCountFrequency (%)
토,일 1305
44.1%
월,화,목,금 1083
36.6%
165
 
5.6%
월,수,금 88
 
3.0%
화,목 76
 
2.6%
월,화,수,목,금,토,일 54
 
1.8%
월,화,수,목,금 51
 
1.7%
월,화,수,목,금,토 33
 
1.1%
25
 
0.8%
화,목,토 22
 
0.7%
Other values (9) 56
 
1.9%

Length

2023-12-13T07:19:10.744757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
토,일 1305
44.1%
월,화,목,금 1083
36.6%
165
 
5.6%
월,수,금 88
 
3.0%
화,목 76
 
2.6%
월,화,수,목,금,토,일 54
 
1.8%
월,화,수,목,금 51
 
1.7%
월,화,수,목,금,토 33
 
1.1%
25
 
0.8%
화,목,토 22
 
0.7%
Other values (9) 56
 
1.9%

강습반
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size23.2 KiB
<NA>
2673 
성인남여
 
236
청소년
 
30
전체
 
17
어린이/성인
 
1

Length

Max length8
Median length4
Mean length3.9803922
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 2673
90.4%
성인남여 236
 
8.0%
청소년 30
 
1.0%
전체 17
 
0.6%
어린이/성인 1
 
< 0.1%
성인남여및청소년 1
 
< 0.1%

Length

2023-12-13T07:19:10.852620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:19:10.960892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2673
90.4%
성인남여 236
 
8.0%
청소년 30
 
1.0%
전체 17
 
0.6%
어린이/성인 1
 
< 0.1%
성인남여및청소년 1
 
< 0.1%

금액
Real number (ℝ)

HIGH CORRELATION 

Distinct55
Distinct (%)1.9%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean172083.64
Minimum10000
Maximum528000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.1 KiB
2023-12-13T07:19:11.080884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10000
5-th percentile70000
Q1145000
median175000
Q3200000
95-th percentile255000
Maximum528000
Range518000
Interquartile range (IQR)55000

Descriptive statistics

Standard deviation51790.951
Coefficient of variation (CV)0.30096382
Kurtosis1.7920414
Mean172083.64
Median Absolute Deviation (MAD)30000
Skewness-0.029341893
Sum5.0885133 × 108
Variance2.6823026 × 109
MonotonicityNot monotonic
2023-12-13T07:19:11.221790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
195000 414
14.0%
215000 333
11.3%
165000 271
 
9.2%
255000 192
 
6.5%
189000 190
 
6.4%
145000 179
 
6.1%
169000 178
 
6.0%
235000 177
 
6.0%
175000 143
 
4.8%
147000 102
 
3.4%
Other values (45) 778
26.3%
ValueCountFrequency (%)
10000 1
 
< 0.1%
30000 4
 
0.1%
33000 1
 
< 0.1%
40000 8
 
0.3%
45000 20
0.7%
50000 25
0.8%
60000 32
1.1%
65000 45
1.5%
67500 1
 
< 0.1%
70000 32
1.1%
ValueCountFrequency (%)
528000 1
 
< 0.1%
480000 1
 
< 0.1%
478000 1
 
< 0.1%
423000 1
 
< 0.1%
396000 1
 
< 0.1%
390000 1
 
< 0.1%
350000 5
0.2%
335000 1
 
< 0.1%
330000 1
 
< 0.1%
308000 1
 
< 0.1%

기간
Text

Distinct61
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size23.2 KiB
2023-12-13T07:19:11.403993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length21
Mean length21
Min length21

Characters and Unicode

Total characters62118
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)0.7%

Sample

1st row2023-01-01~9999-12-31
2nd row2023-01-01~9999-12-31
3rd row2023-01-01~9999-12-31
4th row2023-01-01~9999-12-31
5th row2023-01-01~9999-12-31
ValueCountFrequency (%)
2022-08-01~9999-12-31 792
26.8%
2020-01-01~2022-07-31 307
 
10.4%
2022-07-01~2022-07-31 266
 
9.0%
2022-02-01~2022-07-31 257
 
8.7%
2023-01-01~9999-12-31 202
 
6.8%
2023-07-01~9999-12-31 167
 
5.6%
2020-08-01~2022-07-31 100
 
3.4%
2022-08-01~2023-06-30 86
 
2.9%
2022-11-01~2023-06-30 80
 
2.7%
2022-08-01~2022-10-31 70
 
2.4%
Other values (51) 631
21.3%
2023-12-13T07:19:11.695336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 13830
22.3%
0 12467
20.1%
- 11832
19.0%
1 8215
13.2%
9 5692
9.2%
3 3892
 
6.3%
~ 2958
 
4.8%
7 1458
 
2.3%
8 1263
 
2.0%
6 260
 
0.4%
Other values (2) 251
 
0.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 47328
76.2%
Dash Punctuation 11832
 
19.0%
Math Symbol 2958
 
4.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 13830
29.2%
0 12467
26.3%
1 8215
17.4%
9 5692
12.0%
3 3892
 
8.2%
7 1458
 
3.1%
8 1263
 
2.7%
6 260
 
0.5%
5 177
 
0.4%
4 74
 
0.2%
Dash Punctuation
ValueCountFrequency (%)
- 11832
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2958
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 62118
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 13830
22.3%
0 12467
20.1%
- 11832
19.0%
1 8215
13.2%
9 5692
9.2%
3 3892
 
6.3%
~ 2958
 
4.8%
7 1458
 
2.3%
8 1263
 
2.0%
6 260
 
0.4%
Other values (2) 251
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 62118
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 13830
22.3%
0 12467
20.1%
- 11832
19.0%
1 8215
13.2%
9 5692
9.2%
3 3892
 
6.3%
~ 2958
 
4.8%
7 1458
 
2.3%
8 1263
 
2.0%
6 260
 
0.4%
Other values (2) 251
 
0.4%

시작일자
Categorical

HIGH CORRELATION 

Distinct44
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size23.2 KiB
2022-08-01
1077 
2020-01-01
333 
2022-07-01
266 
2022-02-01
257 
2023-01-01
234 
Other values (39)
791 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique11 ?
Unique (%)0.4%

Sample

1st row2023-01-01
2nd row2023-01-01
3rd row2023-01-01
4th row2023-01-01
5th row2023-01-01

Common Values

ValueCountFrequency (%)
2022-08-01 1077
36.4%
2020-01-01 333
 
11.3%
2022-07-01 266
 
9.0%
2022-02-01 257
 
8.7%
2023-01-01 234
 
7.9%
2023-07-01 167
 
5.6%
2020-08-01 100
 
3.4%
2022-11-01 80
 
2.7%
2023-03-01 67
 
2.3%
2023-05-01 67
 
2.3%
Other values (34) 310
 
10.5%

Length

2023-12-13T07:19:11.816691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-08-01 1077
36.4%
2020-01-01 333
 
11.3%
2022-07-01 266
 
9.0%
2022-02-01 257
 
8.7%
2023-01-01 234
 
7.9%
2023-07-01 167
 
5.6%
2020-08-01 100
 
3.4%
2022-11-01 80
 
2.7%
2023-03-01 67
 
2.3%
2023-05-01 67
 
2.3%
Other values (34) 310
 
10.5%

Interactions

2023-12-13T07:19:06.780357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:05.478375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:05.844870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:06.276663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:06.897529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:05.559677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:05.950209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:06.428077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:07.017803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:05.659002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:06.072246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:06.552817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:07.120897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:05.753837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:06.163652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:19:06.670467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:19:11.905895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호강좌번호시작월강좌코드명정원요일강습반금액기간시작일자
일련번호1.0000.2000.5460.3250.0000.5570.6760.3600.5960.545
강좌번호0.2001.0000.7580.6090.0000.5470.0850.4030.8290.759
시작월0.5460.7581.0000.9590.7000.8940.8180.7441.0001.000
강좌코드명0.3250.6090.9591.0000.9630.9740.9590.8160.9750.961
정원0.0000.0000.7000.9631.0000.249NaN0.0770.8350.669
요일0.5570.5470.8940.9740.2491.0000.9160.7340.9300.904
강습반0.6760.0850.8180.959NaN0.9161.0000.8800.8560.818
금액0.3600.4030.7440.8160.0770.7340.8801.0000.7710.745
기간0.5960.8291.0000.9750.8350.9300.8560.7711.0001.000
시작일자0.5450.7591.0000.9610.6690.9040.8180.7451.0001.000
2023-12-13T07:19:12.019280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강좌코드명시작월요일강습반시작일자
강좌코드명1.0000.5270.6490.7280.529
시작월0.5271.0000.4600.5311.000
요일0.6490.4601.0000.6290.477
강습반0.7280.5310.6291.0000.531
시작일자0.5291.0000.4770.5311.000
2023-12-13T07:19:12.105521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호강좌번호정원금액시작월강좌코드명요일강습반시작일자
일련번호1.000-0.3390.459-0.4560.4340.2570.4950.8040.433
강좌번호-0.3391.000-0.1280.1230.3890.2710.2490.0570.389
정원0.459-0.1281.000-0.5780.5660.8590.2201.0000.566
금액-0.4560.123-0.5781.0000.3750.4600.3940.5580.374
시작월0.4340.3890.5660.3751.0000.5270.4600.5311.000
강좌코드명0.2570.2710.8590.4600.5271.0000.6490.7280.529
요일0.4950.2490.2200.3940.4600.6491.0000.6290.477
강습반0.8040.0571.0000.5580.5310.7280.6291.0000.531
시작일자0.4330.3890.5660.3741.0000.5290.4770.5311.000

Missing values

2023-12-13T07:19:07.253358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:19:07.452301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T07:19:07.587816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

일련번호강좌번호시작월강좌코드명강좌명시작종료시간정원요일강습반금액기간시작일자
01009313202023-01-01검도검도 1800-1930 (월화목금) 청소년18:00~19:3010월,화,목,금,토<NA>700002023-01-01~9999-12-312023-01-01
11009312242023-01-01검도검도 1800-1930 (월화목금) 성인18:00~19:3030월,화,목,금,토<NA>800002023-01-01~9999-12-312023-01-01
21007349612023-01-01검도검도 1930-2100 (월화목금) 성인19:30~21:0060월,화,목,금,토<NA>800002023-01-01~9999-12-312023-01-01
324841462023-01-01검도검도 0600-0730 (월화목금) 성인06:00~07:3030월,화,목,금,토<NA>800002023-01-01~9999-12-312023-01-01
424838442023-01-01검도검도 0600-0730 (월화목금) 청소년06:00~07:3010월,화,목,금,토<NA>700002023-01-01~9999-12-312023-01-01
51013528472023-11-01골프23년 골프 월자유 06:00~21:5006:00~21:5047월,화,수,목,금,토,일성인남여1450002023-11-01~9999-12-312023-11-01
61013528402023-10-01골프23년10월 골프 월자유 06:00~21:5006:00~21:5047월,화,수,목,금,토,일성인남여1398302023-10-01~2023-10-312023-10-01
71013528352023-01-01골프23년 골프 월자유 06:00~21:5006:00~21:5046월,화,수,목,금,토,일성인남여1450002023-01-01~2023-09-302023-01-01
8101183282023-01-01골프특강 12회 19~20시 김상우19:00~20:008월,화,수,목,금,토성인남여3500002023-01-01~9999-12-312023-01-01
9101183192023-01-01골프특강 8회 19~20시 김상우19:00~20:0011월,화,수,목,금,토성인남여2500002023-01-01~9999-12-312023-01-01
일련번호강좌번호시작월강좌코드명강좌명시작종료시간정원요일강습반금액기간시작일자
2948101653962023-06-01로드 사이클5.그룹 패키지(입문4회+그룹4회)10:00~21:502월,화,수,목,금,토,일<NA>4230002023-06-01~9999-12-312023-06-01
2949101653862023-06-01로드 사이클4.입문 패키지(입문3회+그룹8회)10:00~21:501월,화,수,목,금,토,일<NA>3900002023-06-01~9999-12-312023-06-01
2950101653762023-06-01로드 사이클4.입문 패키지(입문3회+그룹4회)10:00~21:502월,화,수,목,금,토,일<NA>3350002023-06-01~9999-12-312023-06-01
29511016536102023-06-01로드 사이클2.그룹 12회10:00~21:506월,화,수,목,금,토,일<NA>2640002023-06-01~9999-12-312023-06-01
29521016535112023-06-01로드 사이클2.그룹 8회10:00~21:507월,화,수,목,금,토,일<NA>2200002023-06-01~9999-12-312023-06-01
29531016534112023-06-01로드 사이클2.그룹 4회10:00~21:506월,화,수,목,금,토,일<NA>1650002023-06-01~9999-12-312023-06-01
2954101653172023-06-01로드 사이클3.단기속성 8회10:00~21:501월,화,수,목,금,토,일<NA>5280002023-06-01~9999-12-312023-06-01
2955101653072023-06-01로드 사이클3.단기속성 4회10:00~21:502월,화,수,목,금,토,일<NA>3080002023-06-01~9999-12-312023-06-01
2956101652972023-06-01로드 사이클3.단기속성 1회10:00~21:502월,화,수,목,금,토,일<NA>1100002023-06-01~9999-12-312023-06-01
2957101652882023-06-01로드 사이클1.입문자10:00~21:5010월,화,수,목,금,토,일<NA>2200002023-06-01~9999-12-312023-06-01