gimi9 Pandas Profiling

Dataset statistics

Number of variables	3
Number of observations	34
Missing cells	0
Missing cells (%)	0.0%
Duplicate rows	0
Duplicate rows (%)	0.0%
Total size in memory	1016.0 B
Average record size in memory	29.9 B

Variable types

Categorical	1
Text	1
Numeric	1

Dataset

Description	부산광역시남구의료급여대상자현황_20200516
Author	부산광역시 남구
URL	http://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3081435

Alerts

인원 has unique values Unique

Reproduction

Analysis started	2023-12-10 17:04:58.404395
Analysis finished	2023-12-10 17:04:59.162683
Duration	0.76 seconds
Software version	ydata-profiling vv4.5.1
Download configuration	config.json

종별
Categorical

Distinct	2
Distinct (%)	5.9%
Missing	0
Missing (%)	0.0%
Memory size	404.0 B

1	17
2	17

Length

Max length	1
Median length	1
Mean length	1
Min length	1

Unique

Unique	0 ?
Unique (%)	0.0%

Sample

1st row	1
2nd row	1
3rd row	1
4th row	1
5th row	1

Common Values

Value	Count	Frequency (%)
1	17	50.0%
2	17	50.0%

Length

Histogram of lengths of the category

Common Values (Plot)

Value	Count	Frequency (%)
1	17	50.0%
2	17	50.0%

구분
Text

Distinct	17
Distinct (%)	50.0%
Missing	0
Missing (%)	0.0%
Memory size	404.0 B

Length

Max length	4
Median length	4
Mean length	3.8823529
Min length	3

Characters and Unicode

Total characters	132
Distinct characters	18
Distinct categories	2 ?
Distinct scripts	2 ?
Distinct blocks	2 ?

The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique	0 ?
Unique (%)	0.0%

Sample

1st row	대연1동
2nd row	대연3동
3rd row	대연4동
4th row	대연5동
5th row	대연6동

Value	Count	Frequency (%)
대연1동	2	5.9%
용당동	2	5.9%
문현4동	2	5.9%
문현3동	2	5.9%
문현2동	2	5.9%
문현1동	2	5.9%
감만2동	2	5.9%
감만1동	2	5.9%
용호4동	2	5.9%
대연3동	2	5.9%
Other values (7)	14	41.2%

Most occurring characters

Value	Count	Frequency (%)
동	34	25.8%
대	10	7.6%
용	10	7.6%
연	10	7.6%
현	8	6.1%
문	8	6.1%
호	8	6.1%
1	8	6.1%
4	6	4.5%
2	6	4.5%
Other values (8)	24	18.2%

Most occurring categories

Value	Count	Frequency (%)
Other Letter	102	77.3%
Decimal Number	30	22.7%

Most frequent character per category

Other Letter

Value	Count	Frequency (%)
동	34	33.3%
대	10	9.8%
용	10	9.8%
연	10	9.8%
현	8	7.8%
문	8	7.8%
호	8	7.8%
감	4	3.9%
만	4	3.9%
당	2	2.0%
Other values (2)	4	3.9%

Decimal Number

Value	Count	Frequency (%)
1	8	26.7%
4	6	20.0%
2	6	20.0%
3	6	20.0%
6	2	6.7%
5	2	6.7%

Most occurring scripts

Value	Count	Frequency (%)
Hangul	102	77.3%
Common	30	22.7%

Most frequent character per script

Hangul

Value	Count	Frequency (%)
동	34	33.3%
대	10	9.8%
용	10	9.8%
연	10	9.8%
현	8	7.8%
문	8	7.8%
호	8	7.8%
감	4	3.9%
만	4	3.9%
당	2	2.0%
Other values (2)	4	3.9%

Common

Value	Count	Frequency (%)
1	8	26.7%
4	6	20.0%
2	6	20.0%
3	6	20.0%
6	2	6.7%
5	2	6.7%

Most occurring blocks

Value	Count	Frequency (%)
Hangul	102	77.3%
ASCII	30	22.7%

Most frequent character per block

Hangul

Value	Count	Frequency (%)
동	34	33.3%
대	10	9.8%
용	10	9.8%
연	10	9.8%
현	8	7.8%
문	8	7.8%
호	8	7.8%
감	4	3.9%
만	4	3.9%
당	2	2.0%
Other values (2)	4	3.9%

ASCII

Value	Count	Frequency (%)
1	8	26.7%
4	6	20.0%
2	6	20.0%
3	6	20.0%
6	2	6.7%
5	2	6.7%

인원
Real number (ℝ)

UNIQUE

Distinct	34
Distinct (%)	100.0%
Missing	0
Missing (%)	0.0%
Infinite	0
Infinite (%)	0.0%
Mean	407.20588

Minimum	185
Maximum	903
Zeros	0
Zeros (%)	0.0%
Negative	0
Negative (%)	0.0%
Memory size	438.0 B

Quantile statistics

Minimum	185
5-th percentile	224.4
Q1	291.25
median	367.5
Q3	488.25
95-th percentile	719.45
Maximum	903
Range	718
Interquartile range (IQR)	197

Descriptive statistics

Standard deviation	162.28609
Coefficient of variation (CV)	0.39853572
Kurtosis	1.4954215
Mean	407.20588
Median Absolute Deviation (MAD)	98
Skewness	1.2134381
Sum	13845
Variance	26336.775
Monotonicity	Not monotonic

Histogram with fixed size bins (bins=34)

Value	Count	Frequency (%)
404	1	2.9%
251	1	2.9%
402	1	2.9%
230	1	2.9%
903	1	2.9%
346	1	2.9%
587	1	2.9%
318	1	2.9%
710	1	2.9%
332	1	2.9%
Other values (24)	24	70.6%

Minimum 10 values
Maximum 10 values

Value	Count	Frequency (%)
185	1	2.9%
214	1	2.9%
230	1	2.9%
251	1	2.9%
264	1	2.9%
268	1	2.9%
271	1	2.9%
275	1	2.9%
291	1	2.9%
292	1	2.9%

Value	Count	Frequency (%)
903	1	2.9%
737	1	2.9%
710	1	2.9%
587	1	2.9%
581	1	2.9%
574	1	2.9%
533	1	2.9%
517	1	2.9%
493	1	2.9%
474	1	2.9%

인원

인원

Phik (φk)
Auto

Heatmap
Table

	종별	구분	인원
종별	1.000	0.000	0.484
구분	0.000	1.000	0.690
인원	0.484	0.690	1.000

Heatmap
Table

	인원	종별
인원	1.000	0.298
종별	0.298	1.000

Count
Matrix

A simple visualization of nullity by column.

Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

First rows
Last rows

	종별	구분	인원
0	1	대연1동	404
1	1	대연3동	275
2	1	대연4동	325
3	1	대연5동	292
4	1	대연6동	185
5	1	용호1동	737
6	1	용호2동	291
7	1	용호3동	493
8	1	용호4동	268
9	1	용당동	214

	종별	구분	인원
24	2	용호3동	587
25	2	용호4동	318
26	2	용당동	251
27	2	감만1동	710
28	2	감만2동	315
29	2	문현1동	581
30	2	문현2동	533
31	2	문현3동	467
32	2	문현4동	357
33	2	우암동	378

Overview

Variables

Common Values

Length

Common Values (Plot)

Most occurring characters

Most occurring categories

Most frequent character per category

Other Letter

Decimal Number

Most occurring scripts

Most frequent character per script

Hangul

Common

Most occurring blocks

Most frequent character per block

Hangul

ASCII

Interactions

Correlations

Missing values

Sample