Overview

Dataset statistics

Number of variables7
Number of observations1833
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory103.9 KiB
Average record size in memory58.1 B

Variable types

Numeric2
DateTime2
Text3

Dataset

Description충청남도 청사 견학에 대한 데이터로서 방문날짜, 방문시간, 단체, 신청자, 방문목적, 견학인원, 활동지역 등에 대한 데이터 입니다.
Author충청남도
URLhttps://www.data.go.kr/data/15063028/fileData.do

Alerts

일련번호 has unique valuesUnique

Reproduction

Analysis started2024-03-14 10:41:59.181860
Analysis finished2024-03-14 10:42:02.037670
Duration2.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

UNIQUE 

Distinct1833
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1268.7103
Minimum1
Maximum3213
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.2 KiB
2024-03-14T19:42:02.459035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile131.6
Q1500
median1123
Q31747
95-th percentile3092.4
Maximum3213
Range3212
Interquartile range (IQR)1247

Descriptive statistics

Standard deviation924.19551
Coefficient of variation (CV)0.72845275
Kurtosis-0.63908363
Mean1268.7103
Median Absolute Deviation (MAD)624
Skewness0.68208109
Sum2325546
Variance854137.34
MonotonicityNot monotonic
2024-03-14T19:42:03.027663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
58 1
 
0.1%
1488 1
 
0.1%
1486 1
 
0.1%
1485 1
 
0.1%
1484 1
 
0.1%
1483 1
 
0.1%
1482 1
 
0.1%
1481 1
 
0.1%
1480 1
 
0.1%
1479 1
 
0.1%
Other values (1823) 1823
99.5%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
5 1
0.1%
26 1
0.1%
27 1
0.1%
31 1
0.1%
32 1
0.1%
35 1
0.1%
41 1
0.1%
ValueCountFrequency (%)
3213 1
0.1%
3212 1
0.1%
3211 1
0.1%
3210 1
0.1%
3209 1
0.1%
3208 1
0.1%
3207 1
0.1%
3206 1
0.1%
3205 1
0.1%
3204 1
0.1%
Distinct992
Distinct (%)54.1%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
Minimum2013-01-08 00:00:00
Maximum2023-12-18 00:00:00
2024-03-14T19:42:03.554275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T19:42:04.040988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct69
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
Minimum2024-03-14 08:40:00
Maximum2024-03-14 17:45:00
2024-03-14T19:42:04.287331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T19:42:04.522867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1700
Distinct (%)92.7%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
2024-03-14T19:42:05.970516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length31
Mean length11.237861
Min length2

Characters and Unicode

Total characters20599
Distinct characters474
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1609 ?
Unique (%)87.8%

Sample

1st row보령시장 등
2nd row태안고남면 주민
3rd row충북교육청 총무과
4th row당진 합덕주민
5th row충남 서부지역 장로
ValueCountFrequency (%)
주민 262
 
5.4%
노인회 197
 
4.1%
서산시 92
 
1.9%
당진시 63
 
1.3%
보령시 56
 
1.2%
태안군 53
 
1.1%
논산시 53
 
1.1%
홍성군 53
 
1.1%
예산군 51
 
1.1%
천안시 48
 
1.0%
Other values (2100) 3893
80.8%
2024-03-14T19:42:07.887562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2992
 
14.5%
643
 
3.1%
639
 
3.1%
587
 
2.8%
561
 
2.7%
559
 
2.7%
500
 
2.4%
481
 
2.3%
406
 
2.0%
349
 
1.7%
Other values (464) 12882
62.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16957
82.3%
Space Separator 2992
 
14.5%
Decimal Number 377
 
1.8%
Other Punctuation 77
 
0.4%
Uppercase Letter 59
 
0.3%
Close Punctuation 52
 
0.3%
Open Punctuation 52
 
0.3%
Dash Punctuation 22
 
0.1%
Lowercase Letter 10
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
643
 
3.8%
639
 
3.8%
587
 
3.5%
561
 
3.3%
559
 
3.3%
500
 
2.9%
481
 
2.8%
406
 
2.4%
349
 
2.1%
348
 
2.1%
Other values (415) 11884
70.1%
Uppercase Letter
ValueCountFrequency (%)
A 11
18.6%
E 5
 
8.5%
T 4
 
6.8%
P 4
 
6.8%
C 4
 
6.8%
G 4
 
6.8%
S 4
 
6.8%
O 3
 
5.1%
K 2
 
3.4%
R 2
 
3.4%
Other values (9) 16
27.1%
Decimal Number
ValueCountFrequency (%)
1 101
26.8%
2 78
20.7%
4 69
18.3%
3 56
14.9%
6 29
 
7.7%
5 27
 
7.2%
7 7
 
1.9%
0 4
 
1.1%
9 3
 
0.8%
8 3
 
0.8%
Lowercase Letter
ValueCountFrequency (%)
a 2
20.0%
s 2
20.0%
n 1
10.0%
p 1
10.0%
c 1
10.0%
t 1
10.0%
m 1
10.0%
w 1
10.0%
Other Punctuation
ValueCountFrequency (%)
, 44
57.1%
. 20
26.0%
/ 5
 
6.5%
& 3
 
3.9%
' 2
 
2.6%
@ 2
 
2.6%
· 1
 
1.3%
Space Separator
ValueCountFrequency (%)
2992
100.0%
Close Punctuation
ValueCountFrequency (%)
) 52
100.0%
Open Punctuation
ValueCountFrequency (%)
( 52
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16957
82.3%
Common 3573
 
17.3%
Latin 69
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
643
 
3.8%
639
 
3.8%
587
 
3.5%
561
 
3.3%
559
 
3.3%
500
 
2.9%
481
 
2.8%
406
 
2.4%
349
 
2.1%
348
 
2.1%
Other values (415) 11884
70.1%
Latin
ValueCountFrequency (%)
A 11
15.9%
E 5
 
7.2%
T 4
 
5.8%
P 4
 
5.8%
C 4
 
5.8%
G 4
 
5.8%
S 4
 
5.8%
O 3
 
4.3%
a 2
 
2.9%
K 2
 
2.9%
Other values (17) 26
37.7%
Common
ValueCountFrequency (%)
2992
83.7%
1 101
 
2.8%
2 78
 
2.2%
4 69
 
1.9%
3 56
 
1.6%
) 52
 
1.5%
( 52
 
1.5%
, 44
 
1.2%
6 29
 
0.8%
5 27
 
0.8%
Other values (12) 73
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16957
82.3%
ASCII 3641
 
17.7%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2992
82.2%
1 101
 
2.8%
2 78
 
2.1%
4 69
 
1.9%
3 56
 
1.5%
) 52
 
1.4%
( 52
 
1.4%
, 44
 
1.2%
6 29
 
0.8%
5 27
 
0.7%
Other values (38) 141
 
3.9%
Hangul
ValueCountFrequency (%)
643
 
3.8%
639
 
3.8%
587
 
3.5%
561
 
3.3%
559
 
3.3%
500
 
2.9%
481
 
2.8%
406
 
2.4%
349
 
2.1%
348
 
2.1%
Other values (415) 11884
70.1%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct86
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
2024-03-14T19:42:08.637738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length4
Mean length4.4189853
Min length2

Characters and Unicode

Total characters8100
Distinct characters181
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)3.8%

Sample

1st row청사견학
2nd row청사견학
3rd row청사견학
4th row청사견학
5th row청사견학
ValueCountFrequency (%)
청사견학 1298
57.2%
견학 351
 
15.5%
청사 218
 
9.6%
시설견학 77
 
3.4%
관광 57
 
2.5%
도청 11
 
0.5%
11
 
0.5%
시설 10
 
0.4%
투어 7
 
0.3%
방청 7
 
0.3%
Other values (161) 221
 
9.7%
2024-03-14T19:42:09.830278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1772
21.9%
1751
21.6%
1570
19.4%
1546
19.1%
435
 
5.4%
105
 
1.3%
102
 
1.3%
75
 
0.9%
60
 
0.7%
41
 
0.5%
Other values (171) 643
 
7.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7588
93.7%
Space Separator 435
 
5.4%
Decimal Number 26
 
0.3%
Other Punctuation 20
 
0.2%
Open Punctuation 12
 
0.1%
Close Punctuation 11
 
0.1%
Uppercase Letter 4
 
< 0.1%
Dash Punctuation 3
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1772
23.4%
1751
23.1%
1570
20.7%
1546
20.4%
105
 
1.4%
102
 
1.3%
75
 
1.0%
60
 
0.8%
41
 
0.5%
22
 
0.3%
Other values (152) 544
 
7.2%
Decimal Number
ValueCountFrequency (%)
4 7
26.9%
0 6
23.1%
1 5
19.2%
5 3
11.5%
3 3
11.5%
2 1
 
3.8%
6 1
 
3.8%
Other Punctuation
ValueCountFrequency (%)
, 14
70.0%
: 4
 
20.0%
& 1
 
5.0%
/ 1
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
C 2
50.0%
T 1
25.0%
V 1
25.0%
Space Separator
ValueCountFrequency (%)
435
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7588
93.7%
Common 508
 
6.3%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1772
23.4%
1751
23.1%
1570
20.7%
1546
20.4%
105
 
1.4%
102
 
1.3%
75
 
1.0%
60
 
0.8%
41
 
0.5%
22
 
0.3%
Other values (152) 544
 
7.2%
Common
ValueCountFrequency (%)
435
85.6%
, 14
 
2.8%
( 12
 
2.4%
) 11
 
2.2%
4 7
 
1.4%
0 6
 
1.2%
1 5
 
1.0%
: 4
 
0.8%
5 3
 
0.6%
- 3
 
0.6%
Other values (6) 8
 
1.6%
Latin
ValueCountFrequency (%)
C 2
50.0%
T 1
25.0%
V 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7588
93.7%
ASCII 512
 
6.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1772
23.4%
1751
23.1%
1570
20.7%
1546
20.4%
105
 
1.4%
102
 
1.3%
75
 
1.0%
60
 
0.8%
41
 
0.5%
22
 
0.3%
Other values (152) 544
 
7.2%
ASCII
ValueCountFrequency (%)
435
85.0%
, 14
 
2.7%
( 12
 
2.3%
) 11
 
2.1%
4 7
 
1.4%
0 6
 
1.2%
1 5
 
1.0%
: 4
 
0.8%
5 3
 
0.6%
- 3
 
0.6%
Other values (9) 12
 
2.3%

견학인원
Real number (ℝ)

Distinct101
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.780687
Minimum1
Maximum800
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.2 KiB
2024-03-14T19:42:10.081019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q115
median30
Q340
95-th percentile80
Maximum800
Range799
Interquartile range (IQR)25

Descriptive statistics

Standard deviation31.226929
Coefficient of variation (CV)0.95260141
Kurtosis201.69267
Mean32.780687
Median Absolute Deviation (MAD)10
Skewness9.3384213
Sum60087
Variance975.12109
MonotonicityNot monotonic
2024-03-14T19:42:10.363203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
40 215
 
11.7%
30 161
 
8.8%
20 130
 
7.1%
35 70
 
3.8%
10 64
 
3.5%
15 63
 
3.4%
25 61
 
3.3%
5 58
 
3.2%
50 51
 
2.8%
80 43
 
2.3%
Other values (91) 917
50.0%
ValueCountFrequency (%)
1 5
 
0.3%
2 24
 
1.3%
3 37
2.0%
4 36
2.0%
5 58
3.2%
6 30
1.6%
7 19
 
1.0%
8 35
1.9%
9 11
 
0.6%
10 64
3.5%
ValueCountFrequency (%)
800 1
 
0.1%
200 3
0.2%
196 1
 
0.1%
186 1
 
0.1%
180 2
0.1%
172 1
 
0.1%
160 2
0.1%
150 4
0.2%
140 1
 
0.1%
138 1
 
0.1%
Distinct107
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
2024-03-14T19:42:11.289242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length3
Mean length2.9907256
Min length2

Characters and Unicode

Total characters5482
Distinct characters103
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)2.9%

Sample

1st row보령
2nd row태안
3rd row충청북도
4th row당진
5th row충청남도
ValueCountFrequency (%)
홍성군 179
 
9.5%
충청남도 146
 
7.8%
천안시 117
 
6.2%
서산시 89
 
4.7%
아산시 81
 
4.3%
예산군 76
 
4.0%
당진시 71
 
3.8%
태안군 68
 
3.6%
논산시 57
 
3.0%
보령 56
 
3.0%
Other values (95) 940
50.0%
2024-03-14T19:42:12.401986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
639
 
11.7%
502
 
9.2%
473
 
8.6%
255
 
4.7%
236
 
4.3%
233
 
4.3%
216
 
3.9%
214
 
3.9%
202
 
3.7%
176
 
3.2%
Other values (93) 2336
42.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5430
99.1%
Space Separator 47
 
0.9%
Other Punctuation 3
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
639
 
11.8%
502
 
9.2%
473
 
8.7%
255
 
4.7%
236
 
4.3%
233
 
4.3%
216
 
4.0%
214
 
3.9%
202
 
3.7%
176
 
3.2%
Other values (88) 2284
42.1%
Other Punctuation
ValueCountFrequency (%)
, 2
66.7%
. 1
33.3%
Space Separator
ValueCountFrequency (%)
47
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5430
99.1%
Common 52
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
639
 
11.8%
502
 
9.2%
473
 
8.7%
255
 
4.7%
236
 
4.3%
233
 
4.3%
216
 
4.0%
214
 
3.9%
202
 
3.7%
176
 
3.2%
Other values (88) 2284
42.1%
Common
ValueCountFrequency (%)
47
90.4%
, 2
 
3.8%
( 1
 
1.9%
) 1
 
1.9%
. 1
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5430
99.1%
ASCII 52
 
0.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
639
 
11.8%
502
 
9.2%
473
 
8.7%
255
 
4.7%
236
 
4.3%
233
 
4.3%
216
 
4.0%
214
 
3.9%
202
 
3.7%
176
 
3.2%
Other values (88) 2284
42.1%
ASCII
ValueCountFrequency (%)
47
90.4%
, 2
 
3.8%
( 1
 
1.9%
) 1
 
1.9%
. 1
 
1.9%

Interactions

2024-03-14T19:42:00.759049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T19:42:00.210762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T19:42:01.036757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T19:42:00.476339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T19:42:12.659270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호방문시간방문목적견학인원
일련번호1.0000.3320.6410.050
방문시간0.3321.0000.0000.000
방문목적0.6410.0001.0000.203
견학인원0.0500.0000.2031.000
2024-03-14T19:42:12.918157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호견학인원
일련번호1.000-0.025
견학인원-0.0251.000

Missing values

2024-03-14T19:42:01.423997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T19:42:01.857677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호방문날짜방문시간단체명방문목적견학인원활동지역
0582013-01-0810:00보령시장 등청사견학40보령
1592013-01-0913:00태안고남면 주민청사견학40태안
2612013-01-1114:00충북교육청 총무과청사견학3충청북도
3602013-01-1110:00당진 합덕주민청사견학60당진
4622013-01-1215:00충남 서부지역 장로청사견학150충청남도
5632013-01-1415:00완도지역 중고 교장단청사견학18충청남도
6642013-01-1514:00대전시장 등청사견학5대전광역시
7652013-01-1512:30보건지소 협의회장청사견학15충청남도
8662013-01-1514:30부석초 교장 등청사견학27충청남도
9682013-01-1614:00천안시 의장 등청사견학34천안
일련번호방문날짜방문시간단체명방문목적견학인원활동지역
182332042023-07-1214:30삽교고등학교청사견학42예산군
182432052023-08-2810:00금산군 추부면 노인회분회청사견학25금산군
182532062023-10-2410:00원광대학교 창의공과대학 도시공학부청사견학73익산시
182632072023-10-2414:00홍성고등학교청사견학2홍성군
182732082023-10-2610:30거산초, 송남초등학교청사견학29아산시
182832092023-11-1610:20논산 노인회청사견학15논산시
182932102023-11-2715:30청년도전 지원사업 참여자청사견학70홍성군
183032112023-12-1110:00월전초등학교청사견학38보령시
183132122023-12-1410:00홍성 장곡초등학교청사견학19홍성군
183232132023-12-1815:30청년도전 지원사업 참여자청사견학47홍성군