Overview

Dataset statistics

Number of variables3
Number of observations160
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory25.8 B

Variable types

Categorical1
Text1
Numeric1

Dataset

Description울산광역시 남구 일자리정책분야 추진사업에 대한 데이터로 부처(일자리정책과, 노인장애인과 등) , 사업명, 예산이 제공됩니다.
Author울산광역시 남구
URLhttps://www.data.go.kr/data/15112034/fileData.do

Alerts

사업명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:28:43.611712
Analysis finished2023-12-12 18:28:44.133910
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

부처
Categorical

Distinct29
Distinct (%)18.1%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
일자리정책과
20 
공원녹지과
15 
여성가족과
13 
건강행복과
12 
건설과
12 
Other values (24)
88 

Length

Max length7
Median length5
Mean length4.8875
Min length3

Unique

Unique6 ?
Unique (%)3.8%

Sample

1st row기획예산실
2nd row기획예산실
3rd row기획예산실
4th row총무과
5th row총무과

Common Values

ValueCountFrequency (%)
일자리정책과 20
12.5%
공원녹지과 15
 
9.4%
여성가족과 13
 
8.1%
건강행복과 12
 
7.5%
건설과 12
 
7.5%
도시창조과 10
 
6.2%
노인장애인과 9
 
5.6%
경제정책과 8
 
5.0%
보건관리과 8
 
5.0%
환경관리과 6
 
3.8%
Other values (19) 47
29.4%

Length

2023-12-13T03:28:44.237952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일자리정책과 20
12.5%
공원녹지과 15
 
9.4%
여성가족과 13
 
8.1%
건강행복과 12
 
7.5%
건설과 12
 
7.5%
도시창조과 10
 
6.2%
노인장애인과 9
 
5.6%
경제정책과 8
 
5.0%
보건관리과 8
 
5.0%
환경관리과 6
 
3.8%
Other values (19) 47
29.4%

사업명
Text

UNIQUE 

Distinct160
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-13T03:28:44.602374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length21
Mean length11.66875
Min length4

Characters and Unicode

Total characters1867
Distinct characters306
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique160 ?
Unique (%)100.0%

Sample

1st row사업체조사
2nd row울산광역시 사회조사
3rd row광업제조업조사
4th row청사 안내 자원봉사 도우미
5th row대체인력
ValueCountFrequency (%)
운영 15
 
4.2%
사업 8
 
2.2%
7
 
2.0%
유지관리 4
 
1.1%
관리 4
 
1.1%
지원 4
 
1.1%
장생포 3
 
0.8%
보건소 3
 
0.8%
조사 3
 
0.8%
하수도 3
 
0.8%
Other values (283) 302
84.8%
2023-12-13T03:28:45.134904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
199
 
10.7%
83
 
4.4%
75
 
4.0%
53
 
2.8%
49
 
2.6%
39
 
2.1%
32
 
1.7%
31
 
1.7%
30
 
1.6%
26
 
1.4%
Other values (296) 1250
67.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1627
87.1%
Space Separator 199
 
10.7%
Decimal Number 12
 
0.6%
Close Punctuation 9
 
0.5%
Open Punctuation 9
 
0.5%
Other Punctuation 4
 
0.2%
Lowercase Letter 4
 
0.2%
Uppercase Letter 2
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
83
 
5.1%
75
 
4.6%
53
 
3.3%
49
 
3.0%
39
 
2.4%
32
 
2.0%
31
 
1.9%
30
 
1.8%
26
 
1.6%
24
 
1.5%
Other values (279) 1185
72.8%
Decimal Number
ValueCountFrequency (%)
2 5
41.7%
1 4
33.3%
0 2
 
16.7%
9 1
 
8.3%
Lowercase Letter
ValueCountFrequency (%)
p 1
25.0%
u 1
25.0%
i 1
25.0%
n 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 2
50.0%
. 1
25.0%
· 1
25.0%
Uppercase Letter
ValueCountFrequency (%)
R 1
50.0%
V 1
50.0%
Space Separator
ValueCountFrequency (%)
199
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1627
87.1%
Common 234
 
12.5%
Latin 6
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
83
 
5.1%
75
 
4.6%
53
 
3.3%
49
 
3.0%
39
 
2.4%
32
 
2.0%
31
 
1.9%
30
 
1.8%
26
 
1.6%
24
 
1.5%
Other values (279) 1185
72.8%
Common
ValueCountFrequency (%)
199
85.0%
) 9
 
3.8%
( 9
 
3.8%
2 5
 
2.1%
1 4
 
1.7%
, 2
 
0.9%
0 2
 
0.9%
- 1
 
0.4%
9 1
 
0.4%
. 1
 
0.4%
Latin
ValueCountFrequency (%)
R 1
16.7%
V 1
16.7%
p 1
16.7%
u 1
16.7%
i 1
16.7%
n 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1627
87.1%
ASCII 239
 
12.8%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
199
83.3%
) 9
 
3.8%
( 9
 
3.8%
2 5
 
2.1%
1 4
 
1.7%
, 2
 
0.8%
0 2
 
0.8%
- 1
 
0.4%
9 1
 
0.4%
. 1
 
0.4%
Other values (6) 6
 
2.5%
Hangul
ValueCountFrequency (%)
83
 
5.1%
75
 
4.6%
53
 
3.3%
49
 
3.0%
39
 
2.4%
32
 
2.0%
31
 
1.9%
30
 
1.8%
26
 
1.6%
24
 
1.5%
Other values (279) 1185
72.8%
None
ValueCountFrequency (%)
· 1
100.0%

예산
Real number (ℝ)

Distinct116
Distinct (%)72.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean302.97125
Minimum3
Maximum3493
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-13T03:28:45.315747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile9.8
Q133
median88
Q3301.5
95-th percentile1277.45
Maximum3493
Range3490
Interquartile range (IQR)268.5

Descriptive statistics

Standard deviation541.79289
Coefficient of variation (CV)1.788265
Kurtosis14.897441
Mean302.97125
Median Absolute Deviation (MAD)71
Skewness3.5391118
Sum48475.4
Variance293539.54
MonotonicityNot monotonic
2023-12-13T03:28:45.858728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
44.0 8
 
5.0%
22.0 7
 
4.4%
300.0 4
 
2.5%
33.0 4
 
2.5%
16.0 4
 
2.5%
32.0 3
 
1.9%
5.0 3
 
1.9%
500.0 3
 
1.9%
21.0 3
 
1.9%
6.0 2
 
1.2%
Other values (106) 119
74.4%
ValueCountFrequency (%)
3.0 1
 
0.6%
4.0 2
1.2%
5.0 3
1.9%
6.0 2
1.2%
10.0 2
1.2%
12.0 2
1.2%
13.0 2
1.2%
15.0 1
 
0.6%
16.0 4
2.5%
18.0 1
 
0.6%
ValueCountFrequency (%)
3493.0 1
0.6%
3212.0 1
0.6%
2808.0 1
0.6%
1909.0 1
0.6%
1837.0 1
0.6%
1574.0 1
0.6%
1511.0 1
0.6%
1400.0 1
0.6%
1271.0 1
0.6%
1250.0 1
0.6%

Interactions

2023-12-13T03:28:43.844752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:28:45.977876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부처예산
부처1.0000.597
예산0.5971.000
2023-12-13T03:28:46.075104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
예산부처
예산1.0000.262
부처0.2621.000

Missing values

2023-12-13T03:28:44.004507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:28:44.093627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

부처사업명예산
0기획예산실사업체조사176.0
1기획예산실울산광역시 사회조사37.0
2기획예산실광업제조업조사13.0
3총무과청사 안내 자원봉사 도우미12.0
4총무과대체인력319.0
5총무과기록물정리43.0
6총무과남구청 청소용역303.0
7주민자치과동행정복지센터 화장실 청소용역124.0
8주민자치과동행정복지센터 바닥 및 유리창 청소용역39.0
9민원여권과여권업무 사무보조21.0
부처사업명예산
150건강행복과모바일헬스케어사업44.0
151건강행복과예방접종등록센터32.0
152건강행복과구강보건사업174.0
153건강행복과국가암사업22.0
154건강행복과금연규제시설 지도점검70.0
155건강행복과지역사회중심 금연지원 서비스60.0
156건강행복과지역사회중심재활사업22.0
157건강행복과영양플러스사업22.0
158건강행복과방문건강관리사업44.0
159고래문화재단울산고래축제1250.0