Overview

Dataset statistics

Number of variables5
Number of observations402
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.0 KiB
Average record size in memory43.3 B

Variable types

DateTime1
Categorical1
Numeric3

Dataset

Description경기도_공공의료사업(무료이동진료사업(외국인근로자))실적 현황
Author경기도의료원
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=6C8E2M90FDK8E7V06RS628872118&infSeq=1

Alerts

진료실적(실인원) is highly overall correlated with 진료실적(연인원) and 1 other fieldsHigh correlation
진료실적(연인원) is highly overall correlated with 진료실적(실인원) and 1 other fieldsHigh correlation
지원금액 is highly overall correlated with 진료실적(실인원) and 1 other fieldsHigh correlation
진료실적(실인원) has 208 (51.7%) zerosZeros
진료실적(연인원) has 208 (51.7%) zerosZeros
지원금액 has 208 (51.7%) zerosZeros

Reproduction

Analysis started2023-12-10 22:23:13.608258
Analysis finished2023-12-10 22:23:14.621600
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년월
Date

Distinct67
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
Minimum2018-01-01 00:00:00
Maximum2023-07-01 00:00:00
2023-12-11T07:23:14.681369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:23:14.803573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

병원명
Categorical

Distinct6
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
안성
67 
포천
67 
수원
67 
의정부
67 
파주
67 

Length

Max length3
Median length2
Mean length2.1666667
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안성
2nd row포천
3rd row수원
4th row의정부
5th row파주

Common Values

ValueCountFrequency (%)
안성 67
16.7%
포천 67
16.7%
수원 67
16.7%
의정부 67
16.7%
파주 67
16.7%
이천 67
16.7%

Length

2023-12-11T07:23:14.957151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:23:15.062418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안성 67
16.7%
포천 67
16.7%
수원 67
16.7%
의정부 67
16.7%
파주 67
16.7%
이천 67
16.7%

진료실적(실인원)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct32
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.0024876
Minimum0
Maximum37
Zeros208
Zeros (%)51.7%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2023-12-11T07:23:15.173555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q35
95-th percentile19.95
Maximum37
Range37
Interquartile range (IQR)5

Descriptive statistics

Standard deviation6.8188161
Coefficient of variation (CV)1.7036445
Kurtosis4.1267534
Mean4.0024876
Median Absolute Deviation (MAD)0
Skewness2.0729879
Sum1609
Variance46.496253
MonotonicityNot monotonic
2023-12-11T07:23:15.278061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
0 208
51.7%
1 39
 
9.7%
2 23
 
5.7%
3 18
 
4.5%
4 12
 
3.0%
9 9
 
2.2%
8 9
 
2.2%
16 7
 
1.7%
13 7
 
1.7%
17 6
 
1.5%
Other values (22) 64
 
15.9%
ValueCountFrequency (%)
0 208
51.7%
1 39
 
9.7%
2 23
 
5.7%
3 18
 
4.5%
4 12
 
3.0%
5 5
 
1.2%
6 5
 
1.2%
7 5
 
1.2%
8 9
 
2.2%
9 9
 
2.2%
ValueCountFrequency (%)
37 1
 
0.2%
33 2
0.5%
30 1
 
0.2%
28 1
 
0.2%
27 1
 
0.2%
26 1
 
0.2%
25 1
 
0.2%
24 3
0.7%
23 1
 
0.2%
22 1
 
0.2%

진료실적(연인원)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct75
Distinct (%)18.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.039801
Minimum0
Maximum170
Zeros208
Zeros (%)51.7%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2023-12-11T07:23:15.608161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q320
95-th percentile71.9
Maximum170
Range170
Interquartile range (IQR)20

Descriptive statistics

Standard deviation25.841226
Coefficient of variation (CV)1.7181894
Kurtosis6.5655741
Mean15.039801
Median Absolute Deviation (MAD)0
Skewness2.3614885
Sum6046
Variance667.76899
MonotonicityNot monotonic
2023-12-11T07:23:15.743131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 208
51.7%
10 10
 
2.5%
1 9
 
2.2%
8 8
 
2.0%
4 6
 
1.5%
20 6
 
1.5%
5 6
 
1.5%
3 6
 
1.5%
7 5
 
1.2%
28 5
 
1.2%
Other values (65) 133
33.1%
ValueCountFrequency (%)
0 208
51.7%
1 9
 
2.2%
2 4
 
1.0%
3 6
 
1.5%
4 6
 
1.5%
5 6
 
1.5%
6 5
 
1.2%
7 5
 
1.2%
8 8
 
2.0%
9 5
 
1.2%
ValueCountFrequency (%)
170 1
0.2%
144 1
0.2%
122 1
0.2%
118 1
0.2%
104 1
0.2%
102 1
0.2%
99 1
0.2%
95 2
0.5%
89 2
0.5%
88 1
0.2%

지원금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct195
Distinct (%)48.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2655763.8
Minimum0
Maximum52987980
Zeros208
Zeros (%)51.7%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2023-12-11T07:23:15.878290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q33524417.5
95-th percentile11902833
Maximum52987980
Range52987980
Interquartile range (IQR)3524417.5

Descriptive statistics

Standard deviation5145993.2
Coefficient of variation (CV)1.9376698
Kurtosis25.573423
Mean2655763.8
Median Absolute Deviation (MAD)0
Skewness3.8951828
Sum1.067617 × 109
Variance2.6481246 × 1013
MonotonicityNot monotonic
2023-12-11T07:23:16.016726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 208
51.7%
4715950 1
 
0.2%
405800 1
 
0.2%
13288667 1
 
0.2%
5073260 1
 
0.2%
44660 1
 
0.2%
1717370 1
 
0.2%
131845 1
 
0.2%
519650 1
 
0.2%
8631900 1
 
0.2%
Other values (185) 185
46.0%
ValueCountFrequency (%)
0 208
51.7%
12200 1
 
0.2%
13610 1
 
0.2%
17170 1
 
0.2%
44660 1
 
0.2%
47008 1
 
0.2%
49790 1
 
0.2%
61640 1
 
0.2%
69700 1
 
0.2%
93670 1
 
0.2%
ValueCountFrequency (%)
52987980 1
0.2%
27020480 1
0.2%
24583090 1
0.2%
23808380 1
0.2%
20239490 1
0.2%
19946660 1
0.2%
19878160 1
0.2%
19365940 1
0.2%
18839130 1
0.2%
16474680 1
0.2%

Interactions

2023-12-11T07:23:14.257034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:23:13.762955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:23:13.997551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:23:14.336442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:23:13.840940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:23:14.084139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:23:14.419229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:23:13.921920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:23:14.175886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:23:16.107780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년월병원명진료실적(실인원)진료실적(연인원)지원금액
년월1.0000.0000.0000.1740.165
병원명0.0001.0000.5510.4420.313
진료실적(실인원)0.0000.5511.0000.8270.600
진료실적(연인원)0.1740.4420.8271.0000.834
지원금액0.1650.3130.6000.8341.000
2023-12-11T07:23:16.212374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
진료실적(실인원)진료실적(연인원)지원금액병원명
진료실적(실인원)1.0000.9720.9540.328
진료실적(연인원)0.9721.0000.9840.262
지원금액0.9540.9841.0000.192
병원명0.3280.2620.1921.000

Missing values

2023-12-11T07:23:14.509908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:23:14.587857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년월병원명진료실적(실인원)진료실적(연인원)지원금액
02019-11안성2394715950
12019-11포천17648398450
22019-12수원3202261210
32019-12의정부1101264820
42019-12파주16407297770
52019-12이천5375924360
62019-12안성000
72019-12포천16455881550
82020-01수원000
92020-01의정부000
년월병원명진료실적(실인원)진료실적(연인원)지원금액
3922019-10수원000
3932019-10의정부1163541020
3942019-10파주9506523600
3952019-10이천281621510
3962019-10안성132719946660
3972019-10포천16568670820
3982019-11수원000
3992019-11의정부000
4002019-11파주10325143630
4012019-11이천3121848230