Overview

Dataset statistics

Number of variables3
Number of observations366
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.1 KiB
Average record size in memory25.4 B

Variable types

Text1
Categorical1
Numeric1

Dataset

Description국립정신건강센터의 일별 입원환자수 현황 데이터 목록입니다. 해당 데이터를 통해 날짜, 요일, 입원환자수를 확인하실 수 있습니다.
Author보건복지부 국립정신건강센터
URLhttps://www.data.go.kr/data/15060198/fileData.do

Alerts

입원환자수 is highly overall correlated with 요일High correlation
요일 is highly overall correlated with 입원환자수High correlation
일자 has unique valuesUnique

Reproduction

Analysis started2023-12-13 00:47:01.982555
Analysis finished2023-12-13 00:47:02.264314
Duration0.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일자
Text

UNIQUE 

Distinct366
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-13T09:47:02.481797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.9781421
Min length2

Characters and Unicode

Total characters3652
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique366 ?
Unique (%)100.0%

Sample

1st row2020-01-01
2nd row2020-01-02
3rd row2020-01-03
4th row2020-01-04
5th row2020-01-05
ValueCountFrequency (%)
2020-01-01 1
 
0.3%
2020-09-08 1
 
0.3%
2020-09-06 1
 
0.3%
2020-09-05 1
 
0.3%
2020-09-04 1
 
0.3%
2020-09-03 1
 
0.3%
2020-09-02 1
 
0.3%
2020-09-01 1
 
0.3%
2020-08-31 1
 
0.3%
2020-08-30 1
 
0.3%
Other values (356) 356
97.3%
2023-12-13T09:47:02.854860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1177
32.2%
2 945
25.9%
- 730
20.0%
1 316
 
8.7%
3 85
 
2.3%
5 67
 
1.8%
7 67
 
1.8%
8 67
 
1.8%
4 66
 
1.8%
9 66
 
1.8%
Other values (3) 66
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2920
80.0%
Dash Punctuation 730
 
20.0%
Other Letter 2
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1177
40.3%
2 945
32.4%
1 316
 
10.8%
3 85
 
2.9%
5 67
 
2.3%
7 67
 
2.3%
8 67
 
2.3%
4 66
 
2.3%
9 66
 
2.3%
6 64
 
2.2%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 730
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3650
99.9%
Hangul 2
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1177
32.2%
2 945
25.9%
- 730
20.0%
1 316
 
8.7%
3 85
 
2.3%
5 67
 
1.8%
7 67
 
1.8%
8 67
 
1.8%
4 66
 
1.8%
9 66
 
1.8%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3650
99.9%
Hangul 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1177
32.2%
2 945
25.9%
- 730
20.0%
1 316
 
8.7%
3 85
 
2.3%
5 67
 
1.8%
7 67
 
1.8%
8 67
 
1.8%
4 66
 
1.8%
9 66
 
1.8%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

요일
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
53 
53 
52 
52 
52 
Other values (3)
104 

Length

Max length4
Median length1
Mean length1.0081967
Min length1

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
53
14.5%
53
14.5%
52
14.2%
52
14.2%
52
14.2%
52
14.2%
51
13.9%
<NA> 1
 
0.3%

Length

2023-12-13T09:47:02.970508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:47:03.062920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
53
14.5%
53
14.5%
52
14.2%
52
14.2%
52
14.2%
52
14.2%
51
13.9%
na 1
 
0.3%

입원환자수
Real number (ℝ)

HIGH CORRELATION 

Distinct92
Distinct (%)25.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83.15847
Minimum1
Maximum15218
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-13T09:47:03.171028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q121
median38
Q359
95-th percentile93
Maximum15218
Range15217
Interquartile range (IQR)38

Descriptive statistics

Standard deviation793.74011
Coefficient of variation (CV)9.54491
Kurtosis365.14188
Mean83.15847
Median Absolute Deviation (MAD)19
Skewness19.097585
Sum30436
Variance630023.37
MonotonicityNot monotonic
2023-12-13T09:47:03.278912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9 13
 
3.6%
28 12
 
3.3%
3 12
 
3.3%
38 9
 
2.5%
34 9
 
2.5%
39 9
 
2.5%
55 9
 
2.5%
37 9
 
2.5%
4 8
 
2.2%
43 8
 
2.2%
Other values (82) 268
73.2%
ValueCountFrequency (%)
1 5
 
1.4%
2 3
 
0.8%
3 12
3.3%
4 8
2.2%
5 4
 
1.1%
6 4
 
1.1%
7 3
 
0.8%
8 4
 
1.1%
9 13
3.6%
11 5
 
1.4%
ValueCountFrequency (%)
15218 1
 
0.3%
109 1
 
0.3%
103 1
 
0.3%
98 1
 
0.3%
97 3
0.8%
96 4
1.1%
95 3
0.8%
94 3
0.8%
93 4
1.1%
92 2
0.5%

Interactions

2023-12-13T09:47:02.059168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T09:47:03.345966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
요일입원환자수
요일1.000NaN
입원환자수NaN1.000
2023-12-13T09:47:03.413291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
입원환자수요일
입원환자수1.0001.000
요일1.0001.000

Missing values

2023-12-13T09:47:02.170452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:47:02.240685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일자요일입원환자수
02020-01-0181
12020-01-0284
22020-01-0387
32020-01-0483
42020-01-0584
52020-01-0689
62020-01-0784
72020-01-0885
82020-01-0984
92020-01-1085
일자요일입원환자수
3562020-12-2396
3572020-12-24103
3582020-12-2595
3592020-12-2692
3602020-12-2797
3612020-12-2891
3622020-12-2977
3632020-12-3073
3642020-12-3158
365총계<NA>15218