Overview

Dataset statistics

Number of variables5
Number of observations594
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory25.1 KiB
Average record size in memory43.2 B

Variable types

Text1
Numeric1
Categorical3

Dataset

Description예술의전당 대관별 부대설비(장비) 사용내역 입니다. 관련 데이터 : 대관명, 사용일자, 장비명, 사용수량, 단가 데이터 기간 : 2016. 2.~2022. 9.
Author예술의전당
URLhttps://www.data.go.kr/data/15106934/fileData.do

Alerts

사용수량 has constant value ""Constant
장비명 is highly overall correlated with 사용일자 and 1 other fieldsHigh correlation
단가 is highly overall correlated with 장비명High correlation
사용일자 is highly overall correlated with 장비명High correlation

Reproduction

Analysis started2024-04-21 14:40:17.100630
Analysis finished2024-04-21 14:40:18.333316
Duration1.23 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct489
Distinct (%)82.3%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2024-04-21T23:40:19.225521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length44
Mean length21.205387
Min length3

Characters and Unicode

Total characters12596
Distinct characters512
Distinct categories15 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique394 ?
Unique (%)66.3%

Sample

1st row도밍고 힌도얀의 영웅의 생애
2nd row바이올리니스트 김지연의 발렌타인 프로포즈
3rd row2016 발렌타인데이 콘서트
4th rowKBS교향악단 제703회 정기연주회
5th row리처드 용재 오닐 <My Way>
ValueCountFrequency (%)
정기연주회 195
 
8.0%
kbs교향악단 128
 
5.2%
73
 
3.0%
리사이틀 35
 
1.4%
피아노 33
 
1.4%
코리안심포니오케스트라 26
 
1.1%
오케스트라 26
 
1.1%
콘서트 24
 
1.0%
국립합창단 23
 
0.9%
시리즈 21
 
0.9%
Other values (982) 1858
76.1%
2024-04-21T23:40:20.782251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1850
 
14.7%
500
 
4.0%
297
 
2.4%
284
 
2.3%
268
 
2.1%
265
 
2.1%
245
 
1.9%
243
 
1.9%
211
 
1.7%
206
 
1.6%
Other values (502) 8227
65.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8121
64.5%
Space Separator 1850
 
14.7%
Decimal Number 1086
 
8.6%
Uppercase Letter 686
 
5.4%
Lowercase Letter 541
 
4.3%
Other Punctuation 106
 
0.8%
Close Punctuation 44
 
0.3%
Open Punctuation 43
 
0.3%
Dash Punctuation 41
 
0.3%
Math Symbol 21
 
0.2%
Other values (5) 57
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
500
 
6.2%
297
 
3.7%
284
 
3.5%
268
 
3.3%
265
 
3.3%
245
 
3.0%
243
 
3.0%
211
 
2.6%
206
 
2.5%
190
 
2.3%
Other values (409) 5412
66.6%
Uppercase Letter
ValueCountFrequency (%)
K 152
22.2%
S 150
21.9%
B 138
20.1%
I 42
 
6.1%
A 31
 
4.5%
O 27
 
3.9%
R 26
 
3.8%
C 20
 
2.9%
E 14
 
2.0%
M 13
 
1.9%
Other values (14) 73
10.6%
Lowercase Letter
ValueCountFrequency (%)
a 67
12.4%
e 60
11.1%
r 58
10.7%
t 41
 
7.6%
h 39
 
7.2%
c 37
 
6.8%
o 33
 
6.1%
s 32
 
5.9%
i 30
 
5.5%
n 26
 
4.8%
Other values (13) 118
21.8%
Other Punctuation
ValueCountFrequency (%)
: 45
42.5%
, 14
 
13.2%
& 12
 
11.3%
12
 
11.3%
. 9
 
8.5%
! 4
 
3.8%
4
 
3.8%
' 2
 
1.9%
? 1
 
0.9%
· 1
 
0.9%
Other values (2) 2
 
1.9%
Decimal Number
ValueCountFrequency (%)
2 193
17.8%
1 191
17.6%
7 172
15.8%
0 169
15.6%
3 81
7.5%
6 70
 
6.4%
8 65
 
6.0%
9 53
 
4.9%
4 52
 
4.8%
5 40
 
3.7%
Letter Number
ValueCountFrequency (%)
7
35.0%
6
30.0%
2
 
10.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
Close Punctuation
ValueCountFrequency (%)
) 35
79.5%
5
 
11.4%
] 4
 
9.1%
Open Punctuation
ValueCountFrequency (%)
( 34
79.1%
5
 
11.6%
[ 4
 
9.3%
Math Symbol
ValueCountFrequency (%)
> 9
42.9%
< 9
42.9%
+ 3
 
14.3%
Other Number
ValueCountFrequency (%)
10
55.6%
8
44.4%
Final Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Initial Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
1850
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8119
64.5%
Common 3228
 
25.6%
Latin 1247
 
9.9%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
500
 
6.2%
297
 
3.7%
284
 
3.5%
268
 
3.3%
265
 
3.3%
245
 
3.0%
243
 
3.0%
211
 
2.6%
206
 
2.5%
190
 
2.3%
Other values (407) 5410
66.6%
Latin
ValueCountFrequency (%)
K 152
 
12.2%
S 150
 
12.0%
B 138
 
11.1%
a 67
 
5.4%
e 60
 
4.8%
r 58
 
4.7%
I 42
 
3.4%
t 41
 
3.3%
h 39
 
3.1%
c 37
 
3.0%
Other values (43) 463
37.1%
Common
ValueCountFrequency (%)
1850
57.3%
2 193
 
6.0%
1 191
 
5.9%
7 172
 
5.3%
0 169
 
5.2%
3 81
 
2.5%
6 70
 
2.2%
8 65
 
2.0%
9 53
 
1.6%
4 52
 
1.6%
Other values (30) 332
 
10.3%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8119
64.5%
ASCII 4404
35.0%
None 27
 
0.2%
Number Forms 20
 
0.2%
Enclosed Alphanum 18
 
0.1%
Punctuation 6
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1850
42.0%
2 193
 
4.4%
1 191
 
4.3%
7 172
 
3.9%
0 169
 
3.8%
K 152
 
3.5%
S 150
 
3.4%
B 138
 
3.1%
3 81
 
1.8%
6 70
 
1.6%
Other values (66) 1238
28.1%
Hangul
ValueCountFrequency (%)
500
 
6.2%
297
 
3.7%
284
 
3.5%
268
 
3.3%
265
 
3.3%
245
 
3.0%
243
 
3.0%
211
 
2.6%
206
 
2.5%
190
 
2.3%
Other values (407) 5410
66.6%
None
ValueCountFrequency (%)
12
44.4%
5
18.5%
5
18.5%
4
 
14.8%
· 1
 
3.7%
Enclosed Alphanum
ValueCountFrequency (%)
10
55.6%
8
44.4%
Number Forms
ValueCountFrequency (%)
7
35.0%
6
30.0%
2
 
10.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
Punctuation
ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

사용일자
Real number (ℝ)

HIGH CORRELATION 

Distinct503
Distinct (%)84.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20188197
Minimum20160212
Maximum20220927
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2024-04-21T23:40:21.194906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20160212
5-th percentile20160518
Q120170710
median20190304
Q320210226
95-th percentile20220508
Maximum20220927
Range60715
Interquartile range (IQR)39516.25

Descriptive statistics

Standard deviation19639.587
Coefficient of variation (CV)0.00097282521
Kurtosis-1.2306264
Mean20188197
Median Absolute Deviation (MAD)19684
Skewness0.14196317
Sum1.1991789 × 1010
Variance3.8571337 × 108
MonotonicityIncreasing
2024-04-21T23:40:21.640232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20190523 2
 
0.3%
20170223 2
 
0.3%
20200502 2
 
0.3%
20210826 2
 
0.3%
20170519 2
 
0.3%
20180630 2
 
0.3%
20210917 2
 
0.3%
20170428 2
 
0.3%
20200130 2
 
0.3%
20180722 2
 
0.3%
Other values (493) 574
96.6%
ValueCountFrequency (%)
20160212 1
0.2%
20160213 1
0.2%
20160214 1
0.2%
20160218 1
0.2%
20160220 1
0.2%
20160224 1
0.2%
20160227 2
0.3%
20160301 1
0.2%
20160302 1
0.2%
20160310 1
0.2%
ValueCountFrequency (%)
20220927 1
0.2%
20220923 1
0.2%
20220922 1
0.2%
20220913 1
0.2%
20220907 1
0.2%
20220901 1
0.2%
20220819 1
0.2%
20220818 1
0.2%
20220812 1
0.2%
20220809 1
0.2%

장비명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
TV
223 
TV (중계케이블 비용)
203 
라디오
94 
인터넷방송
74 

Length

Max length13
Median length5
Mean length6.2912458
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row라디오
2nd rowTV
3rd rowTV
4th rowTV
5th row인터넷방송

Common Values

ValueCountFrequency (%)
TV 223
37.5%
TV (중계케이블 비용) 203
34.2%
라디오 94
15.8%
인터넷방송 74
 
12.5%

Length

2024-04-21T23:40:21.886888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T23:40:22.067848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
tv 426
42.6%
중계케이블 203
20.3%
비용 203
20.3%
라디오 94
 
9.4%
인터넷방송 74
 
7.4%

사용수량
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
1
594 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 594
100.0%

Length

2024-04-21T23:40:22.263824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T23:40:22.420742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 594
100.0%

단가
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
660000
426 
550000
94 
440000
74 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row550000
2nd row660000
3rd row660000
4th row660000
5th row440000

Common Values

ValueCountFrequency (%)
660000 426
71.7%
550000 94
 
15.8%
440000 74
 
12.5%

Length

2024-04-21T23:40:22.585833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T23:40:22.764840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
660000 426
71.7%
550000 94
 
15.8%
440000 74
 
12.5%

Interactions

2024-04-21T23:40:17.563720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T23:40:22.889314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용일자장비명단가
사용일자1.0000.8450.338
장비명0.8451.0001.000
단가0.3381.0001.000
2024-04-21T23:40:23.044407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
장비명단가
장비명1.0000.999
단가0.9991.000
2024-04-21T23:40:23.189815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용일자장비명단가
사용일자1.0000.5170.227
장비명0.5171.0000.999
단가0.2270.9991.000

Missing values

2024-04-21T23:40:17.907366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T23:40:18.212936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대관명사용일자장비명사용수량단가
0도밍고 힌도얀의 영웅의 생애20160212라디오1550000
1바이올리니스트 김지연의 발렌타인 프로포즈20160213TV1660000
22016 발렌타인데이 콘서트20160214TV1660000
3KBS교향악단 제703회 정기연주회20160218TV1660000
4리처드 용재 오닐 <My Way>20160220인터넷방송1440000
5코리안 팝스 오케스트라 신년음악회20160224TV1660000
6박수길과 함께하는 Canto della Passione20160227TV1660000
7손열음 피아노 리사이틀20160227TV1660000
82016 코리아 오페라 스타스 앙상블 정기연주회20160301TV1660000
9토마스 햄슨 첫 내한공연20160302TV1660000
대관명사용일자장비명사용수량단가
584국립심포니 : DRs Pick Ⅱ- 수수께끼20220809인터넷방송1440000
585국립합창단 기획공연 위대한 합창 시리즈 II - 본 윌리엄스, 바다 교향곡20220812TV (중계케이블 비용)1660000
586아메리칸 솔로이스츠 앙상블과 함께하는 한국가곡의 밤20220818TV (중계케이블 비용)1660000
587코리아남성합창단 제21회 정기연주회20220819TV (중계케이블 비용)1660000
588KBS교향악단 제781회 정기연주회20220901라디오1550000
589과천시립교향악단 제66회 정기연주회20220907TV (중계케이블 비용)1660000
590서울대학교 음악대학 가을콘서트20220913TV (중계케이블 비용)1660000
591강남심포니오케스트라 제 94회 정기연주회20220922TV (중계케이블 비용)1660000
592한경아르떼필하모닉 정기연주회20220923TV (중계케이블 비용)1660000
593슈만과 클라라, 그리고 브람스20220927TV (중계케이블 비용)1660000