Overview

Dataset statistics

Number of variables4
Number of observations185
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.5%
Total size in memory5.9 KiB
Average record size in memory32.7 B

Variable types

Categorical1
DateTime1
Text2

Dataset

Description경상북도 시군별 2021년 5월부터 ~ 2023년 12월까지의 도립교향악단 공연 스케줄 정보를 제공합니다. (공연일자, 장소, 행사내용)
Author경상북도
URLhttps://www.data.go.kr/data/15071110/fileData.do

Alerts

Dataset has 1 (0.5%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-16 04:25:11.889363
Analysis finished2024-03-16 04:25:12.199361
Duration0.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

Distinct34
Distinct (%)18.4%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
온라인
28 
경산시
16 
경주시
14 
구미시
12 
기타
12 
Other values (29)
103 

Length

Max length4
Median length3
Mean length2.9027027
Min length2

Unique

Unique9 ?
Unique (%)4.9%

Sample

1st row포항시
2nd row포항시
3rd row경주시
4th row경주시
5th row경주시

Common Values

ValueCountFrequency (%)
온라인 28
15.1%
경산시 16
 
8.6%
경주시 14
 
7.6%
구미시 12
 
6.5%
기타 12
 
6.5%
대구 11
 
5.9%
칠곡군 9
 
4.9%
안동시 8
 
4.3%
포항시 7
 
3.8%
영덕군 6
 
3.2%
Other values (24) 62
33.5%

Length

2024-03-16T13:25:12.258096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
온라인 28
15.1%
경산시 16
 
8.6%
경주시 15
 
8.1%
구미시 15
 
8.1%
기타 12
 
6.5%
대구 11
 
5.9%
칠곡군 9
 
4.9%
포항시 9
 
4.9%
안동시 8
 
4.3%
영덕군 6
 
3.2%
Other values (17) 56
30.3%

일자
Date

Distinct175
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
Minimum2021-01-15 00:00:00
Maximum2023-12-22 00:00:00
2024-03-16T13:25:12.360130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:25:12.461615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

장소
Text

Distinct135
Distinct (%)73.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2024-03-16T13:25:12.657127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length13
Mean length7.8702703
Min length5

Characters and Unicode

Total characters1456
Distinct characters176
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique119 ?
Unique (%)64.3%

Sample

1st row포항문화예술회관
2nd row포항 양포초등학교
3rd row경주 영지초등학교
4th row경주 문화중학교
5th row경주안강제일초등학교
ValueCountFrequency (%)
유튜브 32
 
12.5%
공연 32
 
12.5%
수성아트피아 5
 
2.0%
천마아트센터 4
 
1.6%
경산 4
 
1.6%
대구콘서트하우스 4
 
1.6%
의성 3
 
1.2%
경주 3
 
1.2%
포항문화예술회관 3
 
1.2%
계명아트센터 2
 
0.8%
Other values (147) 163
63.9%
2024-03-16T13:25:13.022073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
89
 
6.1%
86
 
5.9%
71
 
4.9%
61
 
4.2%
50
 
3.4%
41
 
2.8%
38
 
2.6%
36
 
2.5%
33
 
2.3%
32
 
2.2%
Other values (166) 919
63.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1373
94.3%
Space Separator 71
 
4.9%
Close Punctuation 5
 
0.3%
Open Punctuation 5
 
0.3%
Decimal Number 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
89
 
6.5%
86
 
6.3%
61
 
4.4%
50
 
3.6%
41
 
3.0%
38
 
2.8%
36
 
2.6%
33
 
2.4%
32
 
2.3%
32
 
2.3%
Other values (161) 875
63.7%
Space Separator
ValueCountFrequency (%)
71
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1373
94.3%
Common 83
 
5.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
89
 
6.5%
86
 
6.3%
61
 
4.4%
50
 
3.6%
41
 
3.0%
38
 
2.8%
36
 
2.6%
33
 
2.4%
32
 
2.3%
32
 
2.3%
Other values (161) 875
63.7%
Common
ValueCountFrequency (%)
71
85.5%
) 5
 
6.0%
( 5
 
6.0%
3 1
 
1.2%
- 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1373
94.3%
ASCII 83
 
5.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
89
 
6.5%
86
 
6.3%
61
 
4.4%
50
 
3.6%
41
 
3.0%
38
 
2.8%
36
 
2.6%
33
 
2.4%
32
 
2.3%
32
 
2.3%
Other values (161) 875
63.7%
ASCII
ValueCountFrequency (%)
71
85.5%
) 5
 
6.0%
( 5
 
6.0%
3 1
 
1.2%
- 1
 
1.2%
Distinct103
Distinct (%)55.7%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2024-03-16T13:25:13.274762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length44
Mean length18.508108
Min length5

Characters and Unicode

Total characters3424
Distinct characters299
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)50.3%

Sample

1st row제165회 정기연주
2nd row톡톡 클래식 청소년을위한 찾아가는 연주
3rd row톡톡 클래식 청소년을위한 찾아가는 연주
4th row톡톡 클래식 청소년을위한 찾아가는 연주
5th row톡톡 클래식 청소년을위한 찾아가는 연주
ValueCountFrequency (%)
클래식 81
 
12.4%
톡톡 73
 
11.2%
찾아가는청소년음악회 36
 
5.5%
음악회 33
 
5.0%
위한 30
 
4.6%
청소년을 27
 
4.1%
찾아가는 19
 
2.9%
청소년을위한 18
 
2.8%
연주 18
 
2.8%
정기연주회 13
 
2.0%
Other values (219) 306
46.8%
2024-03-16T13:25:13.624032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
480
 
14.0%
146
 
4.3%
128
 
3.7%
103
 
3.0%
94
 
2.7%
92
 
2.7%
89
 
2.6%
89
 
2.6%
82
 
2.4%
82
 
2.4%
Other values (289) 2039
59.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2369
69.2%
Space Separator 480
 
14.0%
Lowercase Letter 234
 
6.8%
Decimal Number 188
 
5.5%
Uppercase Letter 62
 
1.8%
Math Symbol 22
 
0.6%
Other Punctuation 20
 
0.6%
Close Punctuation 16
 
0.5%
Open Punctuation 16
 
0.5%
Dash Punctuation 12
 
0.4%
Other values (2) 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
146
 
6.2%
128
 
5.4%
103
 
4.3%
94
 
4.0%
92
 
3.9%
89
 
3.8%
89
 
3.8%
82
 
3.5%
82
 
3.5%
81
 
3.4%
Other values (220) 1383
58.4%
Lowercase Letter
ValueCountFrequency (%)
a 29
12.4%
e 25
 
10.7%
o 20
 
8.5%
i 19
 
8.1%
n 19
 
8.1%
l 13
 
5.6%
r 13
 
5.6%
s 11
 
4.7%
y 11
 
4.7%
h 8
 
3.4%
Other values (13) 66
28.2%
Uppercase Letter
ValueCountFrequency (%)
T 10
16.1%
C 6
 
9.7%
B 5
 
8.1%
O 5
 
8.1%
R 4
 
6.5%
S 4
 
6.5%
A 3
 
4.8%
N 3
 
4.8%
M 3
 
4.8%
L 2
 
3.2%
Other values (11) 17
27.4%
Decimal Number
ValueCountFrequency (%)
2 69
36.7%
0 32
17.0%
1 30
16.0%
3 19
 
10.1%
7 11
 
5.9%
6 9
 
4.8%
4 6
 
3.2%
5 6
 
3.2%
8 3
 
1.6%
9 3
 
1.6%
Other Punctuation
ValueCountFrequency (%)
. 11
55.0%
& 3
 
15.0%
! 2
 
10.0%
? 1
 
5.0%
, 1
 
5.0%
; 1
 
5.0%
# 1
 
5.0%
Math Symbol
ValueCountFrequency (%)
< 11
50.0%
> 11
50.0%
Space Separator
ValueCountFrequency (%)
480
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Final Punctuation
ValueCountFrequency (%)
3
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2368
69.2%
Common 759
 
22.2%
Latin 296
 
8.6%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
146
 
6.2%
128
 
5.4%
103
 
4.3%
94
 
4.0%
92
 
3.9%
89
 
3.8%
89
 
3.8%
82
 
3.5%
82
 
3.5%
81
 
3.4%
Other values (219) 1382
58.4%
Latin
ValueCountFrequency (%)
a 29
 
9.8%
e 25
 
8.4%
o 20
 
6.8%
i 19
 
6.4%
n 19
 
6.4%
l 13
 
4.4%
r 13
 
4.4%
s 11
 
3.7%
y 11
 
3.7%
T 10
 
3.4%
Other values (34) 126
42.6%
Common
ValueCountFrequency (%)
480
63.2%
2 69
 
9.1%
0 32
 
4.2%
1 30
 
4.0%
3 19
 
2.5%
) 16
 
2.1%
( 16
 
2.1%
- 12
 
1.6%
. 11
 
1.4%
7 11
 
1.4%
Other values (15) 63
 
8.3%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2368
69.2%
ASCII 1050
30.7%
Punctuation 5
 
0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
480
45.7%
2 69
 
6.6%
0 32
 
3.0%
1 30
 
2.9%
a 29
 
2.8%
e 25
 
2.4%
o 20
 
1.9%
i 19
 
1.8%
3 19
 
1.8%
n 19
 
1.8%
Other values (57) 308
29.3%
Hangul
ValueCountFrequency (%)
146
 
6.2%
128
 
5.4%
103
 
4.3%
94
 
4.0%
92
 
3.9%
89
 
3.8%
89
 
3.8%
82
 
3.5%
82
 
3.5%
81
 
3.4%
Other values (219) 1382
58.4%
Punctuation
ValueCountFrequency (%)
3
60.0%
2
40.0%
CJK
ValueCountFrequency (%)
1
100.0%

Missing values

2024-03-16T13:25:12.109936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T13:25:12.173844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명일자장소행사내용
0포항시2021-05-11포항문화예술회관제165회 정기연주
1포항시2021-07-07포항 양포초등학교톡톡 클래식 청소년을위한 찾아가는 연주
2경주시2021-04-30경주 영지초등학교톡톡 클래식 청소년을위한 찾아가는 연주
3경주시2021-05-18경주 문화중학교톡톡 클래식 청소년을위한 찾아가는 연주
4경주시2021-06-09경주안강제일초등학교톡톡 클래식 청소년을위한 찾아가는 연주
5김천시2021-06-04김천조마초등학교톡톡 클래식 청소년을위한 찾아가는 연주
6안동시2021-04-26안동초등학교톡톡 클래식 청소년을위한 찾아가는 연주
7안동시2021-06-08안동녹전초등학교톡톡 클래식 청소년을위한 찾아가는 연주
8안동시2021-09-12안동 하회마을 만송정숲 (특설무대)세계문화유산 축전 음악회
9안동시2021-09-26안동 하회마을 만송정숲 (특설무대)세계문화유산 축전 음악회
시군명일자장소행사내용
175김천시2023-11-10김천중앙초등학교찾아가는청소년음악회 톡톡 클래식
176경주시2023-11-14경주화백컨벤션센터2023바르게살기운동경상북도대회
177울진군2023-11-16울진월송초등학교찾아가는청소년음악회 톡톡 클래식
178안동시2023-11-23안동국제컨벤션센터2023국가유공자나라사랑한마음대회
179구미시2023-11-24구미산동고등학교찾아가는청소년음악회 톡톡 클래식
180성주군2023-11-30성주수륜초등학교찾아가는청소년음악회 톡톡 클래식
181청도군2023-12-06청도동산초등학교찾아가는청소년음악회 톡톡 클래식
182경산시2023-12-11경산 천마아트센터대구경북신공항 국제물류포럼
183안동시2023-12-19안동문화예술의전당안동시 승격 50주년 기념 경북 시?도립 교류공연
184칠곡군2023-12-22칠곡군교육문화회관칠곡군민과 함께하는 adieu 2023! welcome 2024!

Duplicate rows

Most frequently occurring

시군명일자장소행사내용# duplicates
0온라인2023-06-16유튜브 공연2023 도민체전 기념음악회<가수 윤성>4