Overview

Dataset statistics

Number of variables7
Number of observations1158
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory65.7 KiB
Average record size in memory58.1 B

Variable types

DateTime2
Categorical3
Numeric2

Dataset

Description경기도 수원시 박물관 이용 매표 현황에 대한 데이터로 해당연원, 시설구분, 권종, 상세권종, 이용객수, 매표수익에 대한 정보를 포함합니다.
Author경기도 수원시
URLhttps://www.data.go.kr/data/15102514/fileData.do

Alerts

데이터 기준일자 has constant value ""Constant
권종 is highly overall correlated with 상세권종High correlation
상세권종 is highly overall correlated with 권종High correlation
권종 is highly imbalanced (51.5%)Imbalance
이용객수 has 21 (1.8%) zerosZeros
매표수익 has 960 (82.9%) zerosZeros

Reproduction

Analysis started2024-03-30 08:53:40.441363
Analysis finished2024-03-30 08:53:43.203982
Duration2.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연월
Date

Distinct49
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
Minimum2020-01-01 00:00:00
Maximum2024-02-01 00:00:00
2024-03-30T08:53:43.436793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-30T08:53:43.947617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)

시설구분
Categorical

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
수원화성박물관
447 
수원박물관
365 
수원광교박물관
346 

Length

Max length7
Median length7
Mean length6.3696028
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수원박물관
2nd row수원박물관
3rd row수원박물관
4th row수원박물관
5th row수원박물관

Common Values

ValueCountFrequency (%)
수원화성박물관 447
38.6%
수원박물관 365
31.5%
수원광교박물관 346
29.9%

Length

2024-03-30T08:53:44.472846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-30T08:53:44.816982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수원화성박물관 447
38.6%
수원박물관 365
31.5%
수원광교박물관 346
29.9%

권종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
일반무료
837 
일반유료
165 
교육참가자
85 
통합유료
 
34
통합무료
 
23
Other values (2)
 
14

Length

Max length5
Median length4
Mean length4.0544041
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반유료
2nd row일반유료
3rd row일반유료
4th row일반유료
5th row일반유료

Common Values

ValueCountFrequency (%)
일반무료 837
72.3%
일반유료 165
 
14.2%
교육참가자 85
 
7.3%
통합유료 34
 
2.9%
통합무료 23
 
2.0%
기타 12
 
1.0%
체험지유료 2
 
0.2%

Length

2024-03-30T08:53:45.247749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-30T08:53:45.621362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반무료 837
72.3%
일반유료 165
 
14.2%
교육참가자 85
 
7.3%
통합유료 34
 
2.9%
통합무료 23
 
2.0%
기타 12
 
1.0%
체험지유료 2
 
0.2%

상세권종
Categorical

HIGH CORRELATION 

Distinct41
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
어른
201 
청소년+군인
168 
어린이
152 
경로
143 
장애인
134 
Other values (36)
360 

Length

Max length14
Median length11
Mean length4.201209
Min length1

Unique

Unique10 ?
Unique (%)0.9%

Sample

1st row어른
2nd row청소년+군인
3rd row어른자원봉사자
4th row어른단체
5th row청소년+군인단체

Common Values

ValueCountFrequency (%)
어른 201
17.4%
청소년+군인 168
14.5%
어린이 152
13.1%
경로 143
12.3%
장애인 134
11.6%
교육참가자 70
 
6.0%
국가유공자 42
 
3.6%
어린이체험실 38
 
3.3%
수원시민 성인 30
 
2.6%
대관인원 20
 
1.7%
Other values (31) 160
13.8%

Length

2024-03-30T08:53:46.116082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
어른 207
16.7%
청소년+군인 177
14.3%
어린이 152
12.3%
경로 143
11.6%
장애인 134
10.8%
수원시민 71
 
5.7%
교육참가자 70
 
5.7%
국가유공자 42
 
3.4%
어린이체험실 38
 
3.1%
성인 30
 
2.4%
Other values (32) 172
13.9%

이용객수
Real number (ℝ)

ZEROS 

Distinct620
Distinct (%)53.6%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean575.00259
Minimum0
Maximum13409
Zeros21
Zeros (%)1.8%
Negative0
Negative (%)0.0%
Memory size10.3 KiB
2024-03-30T08:53:46.650602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q123
median144
Q3654
95-th percentile2310
Maximum13409
Range13409
Interquartile range (IQR)631

Descriptive statistics

Standard deviation1098.2976
Coefficient of variation (CV)1.9100742
Kurtosis29.437487
Mean575.00259
Median Absolute Deviation (MAD)141
Skewness4.4236835
Sum665278
Variance1206257.6
MonotonicityNot monotonic
2024-03-30T08:53:47.195559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 50
 
4.3%
2 30
 
2.6%
3 25
 
2.2%
0 21
 
1.8%
5 16
 
1.4%
7 14
 
1.2%
15 14
 
1.2%
20 13
 
1.1%
4 12
 
1.0%
10 10
 
0.9%
Other values (610) 952
82.2%
ValueCountFrequency (%)
0 21
1.8%
1 50
4.3%
2 30
2.6%
3 25
2.2%
4 12
 
1.0%
5 16
 
1.4%
6 8
 
0.7%
7 14
 
1.2%
8 8
 
0.7%
9 8
 
0.7%
ValueCountFrequency (%)
13409 1
0.1%
8563 1
0.1%
8365 1
0.1%
7757 1
0.1%
7736 1
0.1%
7185 1
0.1%
7157 1
0.1%
6830 1
0.1%
6491 1
0.1%
6480 1
0.1%

매표수익
Real number (ℝ)

ZEROS 

Distinct141
Distinct (%)12.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean55307.945
Minimum0
Maximum5384000
Zeros960
Zeros (%)82.9%
Negative0
Negative (%)0.0%
Memory size10.3 KiB
2024-03-30T08:53:47.641917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile114750
Maximum5384000
Range5384000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation376836.14
Coefficient of variation (CV)6.8134178
Kurtosis101.97521
Mean55307.945
Median Absolute Deviation (MAD)0
Skewness9.6887643
Sum64046600
Variance1.4200547 × 1011
MonotonicityNot monotonic
2024-03-30T08:53:48.168489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 960
82.9%
1000 11
 
0.9%
2000 9
 
0.8%
6000 6
 
0.5%
21000 4
 
0.3%
8000 4
 
0.3%
500 3
 
0.3%
750 3
 
0.3%
5250 3
 
0.3%
10500 3
 
0.3%
Other values (131) 152
 
13.1%
ValueCountFrequency (%)
0 960
82.9%
500 3
 
0.3%
750 3
 
0.3%
800 1
 
0.1%
1000 11
 
0.9%
1200 1
 
0.1%
1500 2
 
0.2%
2000 9
 
0.8%
2250 1
 
0.1%
2500 1
 
0.1%
ValueCountFrequency (%)
5384000 1
0.1%
4428000 1
0.1%
4198000 1
0.1%
4114000 1
0.1%
3702000 1
0.1%
3468000 1
0.1%
3240000 1
0.1%
3204000 1
0.1%
2820000 1
0.1%
2804000 1
0.1%

데이터 기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
Minimum2024-03-20 00:00:00
Maximum2024-03-20 00:00:00
2024-03-30T08:53:48.476366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-30T08:53:48.735866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-30T08:53:41.626646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-30T08:53:41.028009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-30T08:53:41.918489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-30T08:53:41.308259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-30T08:53:48.976079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연월시설구분권종상세권종이용객수매표수익
연월1.0000.0000.4600.0000.0000.000
시설구분0.0001.0000.3000.5010.2120.078
권종0.4600.3001.0000.8970.0000.207
상세권종0.0000.5010.8971.0000.2570.000
이용객수0.0000.2120.0000.2571.0000.325
매표수익0.0000.0780.2070.0000.3251.000
2024-03-30T08:53:49.266329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
권종시설구분상세권종
권종1.0000.2110.623
시설구분0.2111.0000.283
상세권종0.6230.2831.000
2024-03-30T08:53:49.531278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
이용객수매표수익시설구분권종상세권종
이용객수1.000-0.2430.1370.0000.099
매표수익-0.2431.0000.0490.1120.000
시설구분0.1370.0491.0000.2110.283
권종0.0000.1120.2111.0000.623
상세권종0.0990.0000.2830.6231.000

Missing values

2024-03-30T08:53:42.343717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-30T08:53:42.869031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연월시설구분권종상세권종이용객수매표수익데이터 기준일자
02020-01수원박물관일반유료어른70114020002024-03-20
12020-01수원박물관일반유료청소년+군인20200002024-03-20
22020-01수원박물관일반유료어른자원봉사자110002024-03-20
32020-01수원박물관일반유료어른단체1211210002024-03-20
42020-01수원박물관일반유료청소년+군인단체192960002024-03-20
52020-01수원박물관일반무료어른103302024-03-20
62020-01수원박물관일반무료청소년+군인18502024-03-20
72020-01수원박물관일반무료어린이199702024-03-20
82020-01수원박물관일반무료국가유공자1502024-03-20
92020-01수원박물관일반무료장애인31402024-03-20
연월시설구분권종상세권종이용객수매표수익데이터 기준일자
11482024-02수원화성박물관일반무료경로132902024-03-20
11492024-02수원화성박물관일반무료다자녀(2인이상) 수원시민14202024-03-20
11502024-02수원화성박물관일반무료보훈보상대상자302024-03-20
11512024-02수원화성박물관통합유료어른18630002024-03-20
11522024-02수원화성박물관통합무료어른17902024-03-20
11532024-02수원화성박물관통합무료청소년.군인502024-03-20
11542024-02수원화성박물관통합무료어린이1402024-03-20
11552024-02수원화성박물관통합무료경로402024-03-20
11562024-02수원화성박물관교육참가자교육참가자16802024-03-20
11572024-02수원화성박물관교육참가자대관인원50002024-03-20