Overview

Dataset statistics

Number of variables5
Number of observations1959
Missing cells0
Missing cells (%)0.0%
Duplicate rows8
Duplicate rows (%)0.4%
Total size in memory78.6 KiB
Average record size in memory41.1 B

Variable types

DateTime2
Numeric1
Text1
Categorical1

Dataset

Description전라남도 여수시 관광홈페이지 문화관광해설사 신청에 관한 정보입니다.
Author전라남도 여수시
URLhttps://www.data.go.kr/data/15040847/fileData.do

Alerts

Dataset has 8 (0.4%) duplicate rowsDuplicates
문화관광해설사 신청번호(tourist_idx) has 1721 (87.9%) zerosZeros

Reproduction

Analysis started2023-12-12 23:28:52.033771
Analysis finished2023-12-12 23:28:52.588888
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct279
Distinct (%)14.2%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
Minimum2016-07-01 00:00:00
Maximum2023-09-01 00:00:00
2023-12-13T08:28:52.649375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:28:52.764408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct239
Distinct (%)12.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean92.266973
Minimum0
Maximum1030
Zeros1721
Zeros (%)87.9%
Negative0
Negative (%)0.0%
Memory size17.3 KiB
2023-12-13T08:28:52.887074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile772.1
Maximum1030
Range1030
Interquartile range (IQR)0

Descriptive statistics

Standard deviation249.8964
Coefficient of variation (CV)2.7084057
Kurtosis3.8345564
Mean92.266973
Median Absolute Deviation (MAD)0
Skewness2.3845708
Sum180751
Variance62448.209
MonotonicityNot monotonic
2023-12-13T08:28:53.012932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1721
87.9%
633 1
 
0.1%
789 1
 
0.1%
788 1
 
0.1%
787 1
 
0.1%
791 1
 
0.1%
793 1
 
0.1%
792 1
 
0.1%
795 1
 
0.1%
796 1
 
0.1%
Other values (229) 229
 
11.7%
ValueCountFrequency (%)
0 1721
87.9%
619 1
 
0.1%
631 1
 
0.1%
633 1
 
0.1%
635 1
 
0.1%
636 1
 
0.1%
637 1
 
0.1%
638 1
 
0.1%
639 1
 
0.1%
640 1
 
0.1%
ValueCountFrequency (%)
1030 1
0.1%
1027 1
0.1%
1009 1
0.1%
996 1
0.1%
976 1
0.1%
971 1
0.1%
967 1
0.1%
964 1
0.1%
945 1
0.1%
944 1
0.1%
Distinct1832
Distinct (%)93.5%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
Minimum2016-07-20 14:31:00
Maximum2023-08-21 20:50:00
2023-12-13T08:28:53.134325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:28:53.248594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1616
Distinct (%)82.5%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
2023-12-13T08:28:53.488547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length29
Mean length8.5860133
Min length2

Characters and Unicode

Total characters16820
Distinct characters560
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1449 ?
Unique (%)74.0%

Sample

1st row부산시 교통행정과 벤치마킹팀 50명
2nd row진달레축제
3rd row디즈니랜드 인형 행사
4th row남도한바퀴(금오도)
5th row부산판례연구회
ValueCountFrequency (%)
남도한바퀴 62
 
2.2%
산단투어 59
 
2.1%
팸투어 32
 
1.1%
마이스 31
 
1.1%
투어 24
 
0.9%
산단 21
 
0.7%
촬영 18
 
0.6%
거문도 15
 
0.5%
해설 14
 
0.5%
홍보부스 14
 
0.5%
Other values (1894) 2517
89.7%
2023-12-13T08:28:53.890717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
890
 
5.3%
412
 
2.4%
399
 
2.4%
336
 
2.0%
318
 
1.9%
301
 
1.8%
300
 
1.8%
279
 
1.7%
274
 
1.6%
269
 
1.6%
Other values (550) 13042
77.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14624
86.9%
Space Separator 890
 
5.3%
Close Punctuation 304
 
1.8%
Open Punctuation 301
 
1.8%
Uppercase Letter 262
 
1.6%
Decimal Number 234
 
1.4%
Lowercase Letter 153
 
0.9%
Other Punctuation 46
 
0.3%
Dash Punctuation 4
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
412
 
2.8%
399
 
2.7%
336
 
2.3%
318
 
2.2%
301
 
2.1%
300
 
2.1%
279
 
1.9%
274
 
1.9%
269
 
1.8%
257
 
1.8%
Other values (488) 11479
78.5%
Uppercase Letter
ValueCountFrequency (%)
S 47
17.9%
B 41
15.6%
C 36
13.7%
K 27
10.3%
E 19
7.3%
M 12
 
4.6%
N 10
 
3.8%
P 10
 
3.8%
G 9
 
3.4%
I 9
 
3.4%
Other values (11) 42
16.0%
Lowercase Letter
ValueCountFrequency (%)
c 29
19.0%
b 23
15.0%
k 17
11.1%
s 15
9.8%
e 14
9.2%
t 12
7.8%
m 12
7.8%
i 8
 
5.2%
a 5
 
3.3%
y 4
 
2.6%
Other values (6) 14
9.2%
Decimal Number
ValueCountFrequency (%)
2 54
23.1%
1 53
22.6%
3 44
18.8%
5 25
10.7%
0 23
9.8%
6 16
 
6.8%
4 8
 
3.4%
8 7
 
3.0%
7 3
 
1.3%
9 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
, 17
37.0%
/ 16
34.8%
. 10
21.7%
? 2
 
4.3%
: 1
 
2.2%
Close Punctuation
ValueCountFrequency (%)
) 198
65.1%
] 102
33.6%
} 3
 
1.0%
1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 192
63.8%
[ 108
35.9%
1
 
0.3%
Space Separator
ValueCountFrequency (%)
890
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14624
86.9%
Common 1781
 
10.6%
Latin 415
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
412
 
2.8%
399
 
2.7%
336
 
2.3%
318
 
2.2%
301
 
2.1%
300
 
2.1%
279
 
1.9%
274
 
1.9%
269
 
1.8%
257
 
1.8%
Other values (488) 11479
78.5%
Latin
ValueCountFrequency (%)
S 47
 
11.3%
B 41
 
9.9%
C 36
 
8.7%
c 29
 
7.0%
K 27
 
6.5%
b 23
 
5.5%
E 19
 
4.6%
k 17
 
4.1%
s 15
 
3.6%
e 14
 
3.4%
Other values (27) 147
35.4%
Common
ValueCountFrequency (%)
890
50.0%
) 198
 
11.1%
( 192
 
10.8%
[ 108
 
6.1%
] 102
 
5.7%
2 54
 
3.0%
1 53
 
3.0%
3 44
 
2.5%
5 25
 
1.4%
0 23
 
1.3%
Other values (15) 92
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14620
86.9%
ASCII 2194
 
13.0%
Compat Jamo 4
 
< 0.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
890
40.6%
) 198
 
9.0%
( 192
 
8.8%
[ 108
 
4.9%
] 102
 
4.6%
2 54
 
2.5%
1 53
 
2.4%
S 47
 
2.1%
3 44
 
2.0%
B 41
 
1.9%
Other values (50) 465
21.2%
Hangul
ValueCountFrequency (%)
412
 
2.8%
399
 
2.7%
336
 
2.3%
318
 
2.2%
301
 
2.1%
300
 
2.1%
279
 
1.9%
274
 
1.9%
269
 
1.8%
257
 
1.8%
Other values (487) 11475
78.5%
Compat Jamo
ValueCountFrequency (%)
4
100.0%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
일반투어
1738 
산단투어
221 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반투어
2nd row일반투어
3rd row일반투어
4th row일반투어
5th row일반투어

Common Values

ValueCountFrequency (%)
일반투어 1738
88.7%
산단투어 221
 
11.3%

Length

2023-12-13T08:28:54.008183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:28:54.094306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반투어 1738
88.7%
산단투어 221
 
11.3%

Interactions

2023-12-13T08:28:52.353925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:28:54.161789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
문화관광해설사 신청번호(tourist_idx)문화관광 투어 종류(tourist_cate)
문화관광해설사 신청번호(tourist_idx)1.0000.102
문화관광 투어 종류(tourist_cate)0.1021.000
2023-12-13T08:28:54.529830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
문화관광해설사 신청번호(tourist_idx)문화관광 투어 종류(tourist_cate)
문화관광해설사 신청번호(tourist_idx)1.0000.125
문화관광 투어 종류(tourist_cate)0.1251.000

Missing values

2023-12-13T08:28:52.456113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:28:52.544011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

문화관광해설사 신청날짜(tourist_date)문화관광해설사 신청번호(tourist_idx)문화관광해설사 신청 등록일자(reg_date)문화관광해설사 신청 단체명(tourist_title)문화관광 투어 종류(tourist_cate)
02016-07-0102016-08-03 11:54부산시 교통행정과 벤치마킹팀 50명일반투어
12016-07-0102016-07-20 14:31진달레축제일반투어
22016-08-0102016-07-20 15:25디즈니랜드 인형 행사일반투어
32016-08-0102016-07-22 14:55남도한바퀴(금오도)일반투어
42016-08-0502016-08-03 11:54부산판례연구회일반투어
52016-08-0102016-07-22 14:55금오도(광양)일반투어
62016-07-0102016-08-03 11:54부산판례연구회일반투어
72016-08-0102016-07-22 19:14부산판례연구회일반투어
82016-09-0102016-08-12 10:50테스트일반투어
92016-08-0102016-07-25 18:20제15회 한농연 전국대회일반투어
문화관광해설사 신청날짜(tourist_date)문화관광해설사 신청번호(tourist_idx)문화관광해설사 신청 등록일자(reg_date)문화관광해설사 신청 단체명(tourist_title)문화관광 투어 종류(tourist_cate)
19492023-08-0102023-08-07 19:44서귀포고등학교산단투어
19502023-08-0102023-08-07 19:44문화재청현충관리소일반투어
19512023-08-0102023-08-07 19:47별헤는아이 지역아동센터일반투어
19522023-08-0102023-08-07 19:48독일민헨유승석태권도일반투어
19532023-08-0102023-08-07 19:51삼일고등학교산단투어
19542023-08-0102023-08-14 17:49문화재 야행일반투어
19552023-08-0102023-08-14 17:58다낭회일반투어
19562023-08-0102023-08-18 17:16순천대학교산단투어
19572023-09-0102023-08-21 17:55남도한바퀴일반투어
19582023-08-0102023-08-21 20:50팸투어(야간관광 특화도시)일반투어

Duplicate rows

Most frequently occurring

문화관광해설사 신청날짜(tourist_date)문화관광해설사 신청번호(tourist_idx)문화관광해설사 신청 등록일자(reg_date)문화관광해설사 신청 단체명(tourist_title)문화관광 투어 종류(tourist_cate)# duplicates
02016-11-0102016-10-26 14:44남도한바퀴 금오도일반투어4
12016-11-0102016-10-31 15:43천안시 성결교회 목회자일반투어3
32018-11-0102018-11-05 14:43전주한옥마을단체일반투어3
62022-11-0102022-11-28 13:01동동북축제 지원일반투어3
22018-06-0102018-05-13 09:50정보과학고2산단투어2
42022-10-0102022-10-25 17:09산학협력단산단투어2
52022-11-0102022-11-15 15:08전라남도행정동우회일반투어2
72023-04-0102023-03-29 14:03크루즈입항(이순신광장)일반투어2