Overview

Dataset statistics

Number of variables4
Number of observations23
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory914.0 B
Average record size in memory39.7 B

Variable types

Numeric2
Text1
Categorical1

Dataset

Description해당 데이터는 인천광역시 남동구의 남동구보건소 검사항목 및 수수료에 관련된 자료로서, 인천광역시 남동구 남동구보건소 검사항목 및 수수료의 연번, 검사항목, 비용(원), 비고의 정보를 확인할 수 있다.
Author인천광역시 남동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15104004&srcSe=7661IVAWM27C61E190

Alerts

비고 has constant value ""Constant
연번 has unique valuesUnique
검사항목 has unique valuesUnique

Reproduction

Analysis started2024-03-13 06:15:38.297546
Analysis finished2024-03-13 06:15:38.861679
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12
Minimum1
Maximum23
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size339.0 B
2024-03-13T15:15:38.945546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.1
Q16.5
median12
Q317.5
95-th percentile21.9
Maximum23
Range22
Interquartile range (IQR)11

Descriptive statistics

Standard deviation6.78233
Coefficient of variation (CV)0.56519417
Kurtosis-1.2
Mean12
Median Absolute Deviation (MAD)6
Skewness0
Sum276
Variance46
MonotonicityStrictly increasing
2024-03-13T15:15:39.046209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
1 1
 
4.3%
2 1
 
4.3%
23 1
 
4.3%
22 1
 
4.3%
21 1
 
4.3%
20 1
 
4.3%
19 1
 
4.3%
18 1
 
4.3%
17 1
 
4.3%
16 1
 
4.3%
Other values (13) 13
56.5%
ValueCountFrequency (%)
1 1
4.3%
2 1
4.3%
3 1
4.3%
4 1
4.3%
5 1
4.3%
6 1
4.3%
7 1
4.3%
8 1
4.3%
9 1
4.3%
10 1
4.3%
ValueCountFrequency (%)
23 1
4.3%
22 1
4.3%
21 1
4.3%
20 1
4.3%
19 1
4.3%
18 1
4.3%
17 1
4.3%
16 1
4.3%
15 1
4.3%
14 1
4.3%

검사항목
Text

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
2024-03-13T15:15:39.218574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length5.7826087
Min length2

Characters and Unicode

Total characters133
Distinct characters69
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)100.0%

Sample

1st rowA형 간염
2nd rowB형 간염
3rd rowC형 간염
4th rowHBe - 항원검사
5th row빈혈
ValueCountFrequency (%)
간염 3
 
10.7%
a형 1
 
3.6%
전립선암표지자 1
 
3.6%
난소암표지자 1
 
3.6%
대장암표지자 1
 
3.6%
간암표지자 1
 
3.6%
통풍 1
 
3.6%
류마티스 1
 
3.6%
혈당(채혈 1
 
3.6%
혈당(손끝 1
 
3.6%
Other values (16) 16
57.1%
2024-03-13T15:15:39.504796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 8
 
6.0%
( 8
 
6.0%
6
 
4.5%
5
 
3.8%
5
 
3.8%
5
 
3.8%
4
 
3.0%
4
 
3.0%
4
 
3.0%
4
 
3.0%
Other values (59) 80
60.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 97
72.9%
Close Punctuation 8
 
6.0%
Open Punctuation 8
 
6.0%
Uppercase Letter 8
 
6.0%
Space Separator 5
 
3.8%
Decimal Number 5
 
3.8%
Lowercase Letter 1
 
0.8%
Dash Punctuation 1
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
6.2%
5
 
5.2%
5
 
5.2%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
Other values (43) 53
54.6%
Uppercase Letter
ValueCountFrequency (%)
R 2
25.0%
B 2
25.0%
A 1
12.5%
C 1
12.5%
H 1
12.5%
P 1
12.5%
Decimal Number
ValueCountFrequency (%)
1 1
20.0%
0 1
20.0%
3 1
20.0%
8 1
20.0%
4 1
20.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 97
72.9%
Common 27
 
20.3%
Latin 9
 
6.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
6.2%
5
 
5.2%
5
 
5.2%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
Other values (43) 53
54.6%
Common
ValueCountFrequency (%)
) 8
29.6%
( 8
29.6%
5
18.5%
- 1
 
3.7%
1 1
 
3.7%
0 1
 
3.7%
3 1
 
3.7%
8 1
 
3.7%
4 1
 
3.7%
Latin
ValueCountFrequency (%)
R 2
22.2%
B 2
22.2%
A 1
11.1%
C 1
11.1%
H 1
11.1%
e 1
11.1%
P 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 97
72.9%
ASCII 36
 
27.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 8
22.2%
( 8
22.2%
5
13.9%
R 2
 
5.6%
B 2
 
5.6%
A 1
 
2.8%
C 1
 
2.8%
H 1
 
2.8%
e 1
 
2.8%
- 1
 
2.8%
Other values (6) 6
16.7%
Hangul
ValueCountFrequency (%)
6
 
6.2%
5
 
5.2%
5
 
5.2%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
Other values (43) 53
54.6%

비용(원)
Real number (ℝ)

Distinct22
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5913.0435
Minimum400
Maximum21600
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size339.0 B
2024-03-13T15:15:39.618085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum400
5-th percentile610
Q1950
median4000
Q37800
95-th percentile16260
Maximum21600
Range21200
Interquartile range (IQR)6850

Descriptive statistics

Standard deviation5768.118
Coefficient of variation (CV)0.97549055
Kurtosis1.2498454
Mean5913.0435
Median Absolute Deviation (MAD)3100
Skewness1.3039905
Sum136000
Variance33271186
MonotonicityNot monotonic
2024-03-13T15:15:39.725278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
900 2
 
8.7%
15000 1
 
4.3%
2500 1
 
4.3%
3300 1
 
4.3%
6400 1
 
4.3%
6900 1
 
4.3%
5800 1
 
4.3%
4000 1
 
4.3%
1000 1
 
4.3%
800 1
 
4.3%
Other values (12) 12
52.2%
ValueCountFrequency (%)
400 1
4.3%
600 1
4.3%
700 1
4.3%
800 1
4.3%
900 2
8.7%
1000 1
4.3%
2400 1
4.3%
2500 1
4.3%
2900 1
4.3%
3300 1
4.3%
ValueCountFrequency (%)
21600 1
4.3%
16400 1
4.3%
15000 1
4.3%
12800 1
4.3%
8500 1
4.3%
8300 1
4.3%
7300 1
4.3%
6900 1
4.3%
6600 1
4.3%
6400 1
4.3%

비고
Categorical

CONSTANT 

Distinct1
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size316.0 B
의사 미채용으로 일시 중단
23 

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의사 미채용으로 일시 중단
2nd row의사 미채용으로 일시 중단
3rd row의사 미채용으로 일시 중단
4th row의사 미채용으로 일시 중단
5th row의사 미채용으로 일시 중단

Common Values

ValueCountFrequency (%)
의사 미채용으로 일시 중단 23
100.0%

Length

2024-03-13T15:15:39.832068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T15:15:39.914787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
의사 23
25.0%
미채용으로 23
25.0%
일시 23
25.0%
중단 23
25.0%

Interactions

2024-03-13T15:15:38.582511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T15:15:38.425928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T15:15:38.653909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T15:15:38.502508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T15:15:39.968943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번검사항목비용(원)
연번1.0001.0000.000
검사항목1.0001.0001.000
비용(원)0.0001.0001.000
2024-03-13T15:15:40.066687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번비용(원)
연번1.000-0.191
비용(원)-0.1911.000

Missing values

2024-03-13T15:15:38.747780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T15:15:38.824735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번검사항목비용(원)비고
01A형 간염15000의사 미채용으로 일시 중단
12B형 간염2900의사 미채용으로 일시 중단
23C형 간염7300의사 미채용으로 일시 중단
34HBe - 항원검사8300의사 미채용으로 일시 중단
45빈혈900의사 미채용으로 일시 중단
56소변검사(10종)900의사 미채용으로 일시 중단
67신장기능검사(3종)2400의사 미채용으로 일시 중단
78간기능(8종)6600의사 미채용으로 일시 중단
89고지혈증(4종)8500의사 미채용으로 일시 중단
910갑상선12800의사 미채용으로 일시 중단
연번검사항목비용(원)비고
1314혈액형2500의사 미채용으로 일시 중단
1415혈당(손끝)400의사 미채용으로 일시 중단
1516혈당(채혈)600의사 미채용으로 일시 중단
1617류마티스800의사 미채용으로 일시 중단
1718통풍1000의사 미채용으로 일시 중단
1819간암표지자4000의사 미채용으로 일시 중단
1920대장암표지자5800의사 미채용으로 일시 중단
2021난소암표지자6900의사 미채용으로 일시 중단
2122전립선암표지자6400의사 미채용으로 일시 중단
2223흉부직촬(폐결핵)3300의사 미채용으로 일시 중단