Overview

Dataset statistics

Number of variables10
Number of observations246
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory19.8 KiB
Average record size in memory82.5 B

Variable types

Numeric1
DateTime3
Categorical2
Text3
Boolean1

Dataset

Description한국한의학연구원 대표홈페이지를 통해 견학신청을 완료한 통계 데이터 입니다. 데이터의 항목은 신청자번호,신청일,신청시간구분,신청인원,신청자분류,신청단체명,신청자메모,승인현황,등록자,등록일,수정,수정일로 구성되어 있습니다.
URLhttps://www.data.go.kr/data/15085970/fileData.do

Alerts

승인현황 is highly imbalanced (59.3%)Imbalance
신청번호 has unique valuesUnique
등록자 has unique valuesUnique
수정자 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:01:44.653369
Analysis finished2023-12-11 23:01:45.356694
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

신청번호
Real number (ℝ)

UNIQUE 

Distinct246
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean215.04065
Minimum20
Maximum581
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-12T08:01:45.423574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile36.5
Q1100.5
median207.5
Q3317.75
95-th percentile395.75
Maximum581
Range561
Interquartile range (IQR)217.25

Descriptive statistics

Standard deviation127.69507
Coefficient of variation (CV)0.59381829
Kurtosis-0.29904111
Mean215.04065
Median Absolute Deviation (MAD)109
Skewness0.42845913
Sum52900
Variance16306.031
MonotonicityStrictly increasing
2023-12-12T08:01:45.572156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20 1
 
0.4%
272 1
 
0.4%
275 1
 
0.4%
277 1
 
0.4%
278 1
 
0.4%
279 1
 
0.4%
283 1
 
0.4%
284 1
 
0.4%
286 1
 
0.4%
288 1
 
0.4%
Other values (236) 236
95.9%
ValueCountFrequency (%)
20 1
0.4%
21 1
0.4%
23 1
0.4%
25 1
0.4%
26 1
0.4%
27 1
0.4%
28 1
0.4%
29 1
0.4%
30 1
0.4%
31 1
0.4%
ValueCountFrequency (%)
581 1
0.4%
580 1
0.4%
576 1
0.4%
574 1
0.4%
570 1
0.4%
568 1
0.4%
415 1
0.4%
411 1
0.4%
410 1
0.4%
404 1
0.4%
Distinct217
Distinct (%)88.2%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
Minimum2015-08-31 00:00:00
Maximum2023-04-27 00:00:00
2023-12-12T08:01:45.756755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:01:45.904302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

신청시간
Categorical

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
1
131 
2
115 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 131
53.3%
2 115
46.7%

Length

2023-12-12T08:01:46.018223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:01:46.113148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 131
53.3%
2 115
46.7%

신청자분류
Categorical

Distinct8
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
초등학생
66 
중학생
63 
고등학생
51 
일반인
26 
대학생
21 
Other values (3)
19 

Length

Max length6
Median length4
Mean length3.5284553
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중학생
2nd row일반인
3rd row관련 종사자
4th row고등학생
5th row중학생

Common Values

ValueCountFrequency (%)
초등학생 66
26.8%
중학생 63
25.6%
고등학생 51
20.7%
일반인 26
 
10.6%
대학생 21
 
8.5%
기타 11
 
4.5%
초등학교이하 5
 
2.0%
관련 종사자 3
 
1.2%

Length

2023-12-12T08:01:46.219210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:01:46.341740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
초등학생 66
26.5%
중학생 63
25.3%
고등학생 51
20.5%
일반인 26
 
10.4%
대학생 21
 
8.4%
기타 11
 
4.4%
초등학교이하 5
 
2.0%
관련 3
 
1.2%
종사자 3
 
1.2%
Distinct144
Distinct (%)58.5%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T08:01:46.661192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length5.8089431
Min length2

Characters and Unicode

Total characters1429
Distinct characters151
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique109 ?
Unique (%)44.3%

Sample

1st row대전***원
2nd row대전***회
3rd row대외***팀
4th row김해***교
5th row대전***교
ValueCountFrequency (%)
대전***교 16
 
6.5%
대전***원 13
 
5.3%
서울***교 12
 
4.9%
개인*인 10
 
4.1%
대전***회 6
 
2.4%
둔산***교 5
 
2.0%
울산***감 5
 
2.0%
삼천***교 5
 
2.0%
한국***원 5
 
2.0%
충남***과 4
 
1.6%
Other values (134) 165
67.1%
2023-12-12T08:01:47.161578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 692
48.4%
86
 
6.0%
70
 
4.9%
52
 
3.6%
31
 
2.2%
26
 
1.8%
18
 
1.3%
17
 
1.2%
15
 
1.0%
14
 
1.0%
Other values (141) 408
28.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 704
49.3%
Other Punctuation 695
48.6%
Uppercase Letter 15
 
1.0%
Close Punctuation 5
 
0.3%
Lowercase Letter 5
 
0.3%
Decimal Number 3
 
0.2%
Open Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
86
 
12.2%
70
 
9.9%
52
 
7.4%
31
 
4.4%
26
 
3.7%
18
 
2.6%
17
 
2.4%
15
 
2.1%
14
 
2.0%
13
 
1.8%
Other values (121) 362
51.4%
Uppercase Letter
ValueCountFrequency (%)
K 4
26.7%
A 3
20.0%
O 2
13.3%
I 1
 
6.7%
P 1
 
6.7%
M 1
 
6.7%
L 1
 
6.7%
G 1
 
6.7%
S 1
 
6.7%
Other Punctuation
ValueCountFrequency (%)
* 692
99.6%
. 2
 
0.3%
, 1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
e 2
40.0%
d 2
40.0%
y 1
20.0%
Decimal Number
ValueCountFrequency (%)
0 1
33.3%
2 1
33.3%
1 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 705
49.3%
Hangul 704
49.3%
Latin 20
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
86
 
12.2%
70
 
9.9%
52
 
7.4%
31
 
4.4%
26
 
3.7%
18
 
2.6%
17
 
2.4%
15
 
2.1%
14
 
2.0%
13
 
1.8%
Other values (121) 362
51.4%
Latin
ValueCountFrequency (%)
K 4
20.0%
A 3
15.0%
e 2
10.0%
d 2
10.0%
O 2
10.0%
I 1
 
5.0%
P 1
 
5.0%
M 1
 
5.0%
L 1
 
5.0%
G 1
 
5.0%
Other values (2) 2
10.0%
Common
ValueCountFrequency (%)
* 692
98.2%
) 5
 
0.7%
. 2
 
0.3%
( 2
 
0.3%
, 1
 
0.1%
0 1
 
0.1%
2 1
 
0.1%
1 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 725
50.7%
Hangul 704
49.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 692
95.4%
) 5
 
0.7%
K 4
 
0.6%
A 3
 
0.4%
e 2
 
0.3%
d 2
 
0.3%
. 2
 
0.3%
O 2
 
0.3%
( 2
 
0.3%
I 1
 
0.1%
Other values (10) 10
 
1.4%
Hangul
ValueCountFrequency (%)
86
 
12.2%
70
 
9.9%
52
 
7.4%
31
 
4.4%
26
 
3.7%
18
 
2.6%
17
 
2.4%
15
 
2.1%
14
 
2.0%
13
 
1.8%
Other values (121) 362
51.4%

승인현황
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size378.0 B
True
226 
False
 
20
ValueCountFrequency (%)
True 226
91.9%
False 20
 
8.1%
2023-12-12T08:01:47.273325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

등록자
Text

UNIQUE 

Distinct246
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T08:01:47.579508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length5.7520325
Min length5

Characters and Unicode

Total characters1415
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique246 ?
Unique (%)100.0%

Sample

1st row홍길동20
2nd row홍길동21
3rd row홍길동23
4th row홍길동25
5th row홍길동26
ValueCountFrequency (%)
홍길동20 1
 
0.4%
홍길동267 1
 
0.4%
홍길동272 1
 
0.4%
홍길동273 1
 
0.4%
홍길동275 1
 
0.4%
홍길동277 1
 
0.4%
홍길동278 1
 
0.4%
홍길동279 1
 
0.4%
홍길동283 1
 
0.4%
홍길동284 1
 
0.4%
Other values (236) 236
95.9%
2023-12-12T08:01:48.092402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
246
17.4%
246
17.4%
246
17.4%
3 110
7.8%
1 107
7.6%
2 96
 
6.8%
5 60
 
4.2%
0 55
 
3.9%
7 55
 
3.9%
4 54
 
3.8%
Other values (3) 140
9.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 738
52.2%
Decimal Number 677
47.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 110
16.2%
1 107
15.8%
2 96
14.2%
5 60
8.9%
0 55
8.1%
7 55
8.1%
4 54
8.0%
8 53
7.8%
9 44
 
6.5%
6 43
 
6.4%
Other Letter
ValueCountFrequency (%)
246
33.3%
246
33.3%
246
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 738
52.2%
Common 677
47.8%

Most frequent character per script

Common
ValueCountFrequency (%)
3 110
16.2%
1 107
15.8%
2 96
14.2%
5 60
8.9%
0 55
8.1%
7 55
8.1%
4 54
8.0%
8 53
7.8%
9 44
 
6.5%
6 43
 
6.4%
Hangul
ValueCountFrequency (%)
246
33.3%
246
33.3%
246
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 738
52.2%
ASCII 677
47.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
246
33.3%
246
33.3%
246
33.3%
ASCII
ValueCountFrequency (%)
3 110
16.2%
1 107
15.8%
2 96
14.2%
5 60
8.9%
0 55
8.1%
7 55
8.1%
4 54
8.0%
8 53
7.8%
9 44
 
6.5%
6 43
 
6.4%
Distinct139
Distinct (%)56.5%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
Minimum2015-08-21 00:00:00
Maximum2023-03-10 00:00:00
2023-12-12T08:01:48.237097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:01:48.367031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

수정자
Text

UNIQUE 

Distinct246
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T08:01:48.731030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length5.7520325
Min length5

Characters and Unicode

Total characters1415
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique246 ?
Unique (%)100.0%

Sample

1st row홍길동20
2nd row홍길동21
3rd row홍길동23
4th row홍길동25
5th row홍길동26
ValueCountFrequency (%)
홍길동20 1
 
0.4%
홍길동267 1
 
0.4%
홍길동272 1
 
0.4%
홍길동273 1
 
0.4%
홍길동275 1
 
0.4%
홍길동277 1
 
0.4%
홍길동278 1
 
0.4%
홍길동279 1
 
0.4%
홍길동283 1
 
0.4%
홍길동284 1
 
0.4%
Other values (236) 236
95.9%
2023-12-12T08:01:49.268161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
246
17.4%
246
17.4%
246
17.4%
3 110
7.8%
1 107
7.6%
2 96
 
6.8%
5 60
 
4.2%
0 55
 
3.9%
7 55
 
3.9%
4 54
 
3.8%
Other values (3) 140
9.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 738
52.2%
Decimal Number 677
47.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 110
16.2%
1 107
15.8%
2 96
14.2%
5 60
8.9%
0 55
8.1%
7 55
8.1%
4 54
8.0%
8 53
7.8%
9 44
 
6.5%
6 43
 
6.4%
Other Letter
ValueCountFrequency (%)
246
33.3%
246
33.3%
246
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 738
52.2%
Common 677
47.8%

Most frequent character per script

Common
ValueCountFrequency (%)
3 110
16.2%
1 107
15.8%
2 96
14.2%
5 60
8.9%
0 55
8.1%
7 55
8.1%
4 54
8.0%
8 53
7.8%
9 44
 
6.5%
6 43
 
6.4%
Hangul
ValueCountFrequency (%)
246
33.3%
246
33.3%
246
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 738
52.2%
ASCII 677
47.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
246
33.3%
246
33.3%
246
33.3%
ASCII
ValueCountFrequency (%)
3 110
16.2%
1 107
15.8%
2 96
14.2%
5 60
8.9%
0 55
8.1%
7 55
8.1%
4 54
8.0%
8 53
7.8%
9 44
 
6.5%
6 43
 
6.4%
Distinct139
Distinct (%)56.5%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
Minimum2015-08-21 00:00:00
Maximum2023-03-10 00:00:00
2023-12-12T08:01:49.436110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:01:49.587778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T08:01:44.959811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:01:49.666142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청번호신청시간신청자분류승인현황
신청번호1.0000.0670.3100.421
신청시간0.0671.0000.1400.203
신청자분류0.3100.1401.0000.329
승인현황0.4210.2030.3291.000
2023-12-12T08:01:49.769472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청시간승인현황신청자분류
신청시간1.0000.1300.103
승인현황0.1301.0000.243
신청자분류0.1030.2431.000
2023-12-12T08:01:49.878954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청번호신청시간신청자분류승인현황
신청번호1.0000.0650.1570.415
신청시간0.0651.0000.1030.130
신청자분류0.1570.1031.0000.243
승인현황0.4150.1300.2431.000

Missing values

2023-12-12T08:01:45.131280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:01:45.298177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

신청번호신청일신청시간신청자분류신청단체명승인현황등록자등록일수정자수정일
0202015-09-091중학생대전***원Y홍길동202015-08-21홍길동202015-08-21
1212015-09-101일반인대전***회Y홍길동212015-08-21홍길동212015-08-21
2232015-08-311관련 종사자대외***팀Y홍길동232015-08-21홍길동232015-08-21
3252015-10-141고등학생김해***교Y홍길동252015-09-01홍길동252015-09-01
4262015-10-061중학생대전***교Y홍길동262015-09-01홍길동262015-09-01
5272015-10-221중학생삼천***교Y홍길동272015-09-01홍길동272015-09-01
6282015-10-281고등학생김해***고Y홍길동282015-09-02홍길동282015-09-02
7292015-10-222고등학생대전***교Y홍길동292015-09-02홍길동292015-09-02
8302015-10-022초등학생대전***리Y홍길동302015-09-08홍길동302015-09-08
9312015-10-081중학생김천***교Y홍길동312015-09-08홍길동312015-09-08
신청번호신청일신청시간신청자분류신청단체명승인현황등록자등록일수정자수정일
2364042019-09-262초등학생대전***교Y홍길동4042019-09-17홍길동4042019-09-17
2374102020-01-231초등학교이하과학***집N홍길동4102019-12-09홍길동4102019-12-09
2384112020-04-231대학생충남***과N홍길동4112020-03-04홍길동4112020-03-04
2394152020-06-251초등학생유성***청N홍길동4152020-05-25홍길동4152020-05-25
2405682022-07-282초등학생대전***어Y홍길동5682022-07-12홍길동5682022-07-12
2415702022-08-252초등학생대전***어Y홍길동5702022-07-12홍길동5702022-07-12
2425742022-10-271중학생광양***eY홍길동5742022-09-13홍길동5742022-09-13
2435762022-10-272대학생충남***과Y홍길동5762022-09-13홍길동5762022-09-13
2445802023-04-271기타예정*정Y홍길동5802023-03-10홍길동5802023-03-10
2455812023-04-272기타예정*정Y홍길동5812023-03-10홍길동5812023-03-10