Overview

Dataset statistics

Number of variables5
Number of observations49
Missing cells27
Missing cells (%)11.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory43.6 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description본 공공데이터는 완주군 관내에 위치한 직업소개사업소에 대한 데이터로 사업소명, 사업소 주소, 전화번호 등의 항목을 제공합니다
Author전라북도 완주군
URLhttps://www.data.go.kr/data/3076584/fileData.do

Alerts

연번 is highly overall correlated with 유무료구분High correlation
유무료구분 is highly overall correlated with 연번High correlation
유무료구분 is highly imbalanced (59.2%)Imbalance
사업소전화번호 has 27 (55.1%) missing valuesMissing
연번 has unique valuesUnique
사업소명 has unique valuesUnique

Reproduction

Analysis started2024-03-14 20:53:53.813274
Analysis finished2024-03-14 20:53:55.046879
Duration1.23 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25
Minimum1
Maximum49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size569.0 B
2024-03-15T05:53:55.335537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.4
Q113
median25
Q337
95-th percentile46.6
Maximum49
Range48
Interquartile range (IQR)24

Descriptive statistics

Standard deviation14.28869
Coefficient of variation (CV)0.57154761
Kurtosis-1.2
Mean25
Median Absolute Deviation (MAD)12
Skewness0
Sum1225
Variance204.16667
MonotonicityStrictly increasing
2024-03-15T05:53:55.814804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
1 1
 
2.0%
38 1
 
2.0%
28 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%
41 1
2.0%
40 1
2.0%

유무료구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size520.0 B
유료
45 
무료
 
4

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유료
2nd row유료
3rd row유료
4th row유료
5th row유료

Common Values

ValueCountFrequency (%)
유료 45
91.8%
무료 4
 
8.2%

Length

2024-03-15T05:53:56.202512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T05:53:56.370798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유료 45
91.8%
무료 4
 
8.2%

사업소명
Text

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size520.0 B
2024-03-15T05:53:57.648352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length6.1632653
Min length4

Characters and Unicode

Total characters302
Distinct characters118
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)100.0%

Sample

1st row참좋은일터
2nd row개상인력개발 봉동점
3rd row대우인력
4th row공원인력공사
5th row효성파워인력
ValueCountFrequency (%)
봉동점 2
 
3.4%
2
 
3.4%
직업소개소 2
 
3.4%
가치인력 2
 
3.4%
참좋은일터 1
 
1.7%
대한인력 1
 
1.7%
삼례삼봉인력공사 1
 
1.7%
동원종합인력 1
 
1.7%
베스트인력 1
 
1.7%
역전종합인력 1
 
1.7%
Other values (45) 45
76.3%
2024-03-15T05:53:59.162670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
 
12.6%
38
 
12.6%
15
 
5.0%
10
 
3.3%
10
 
3.3%
6
 
2.0%
6
 
2.0%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (108) 163
54.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 287
95.0%
Space Separator 10
 
3.3%
Lowercase Letter 3
 
1.0%
Open Punctuation 1
 
0.3%
Close Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
13.2%
38
 
13.2%
15
 
5.2%
10
 
3.5%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.7%
5
 
1.7%
5
 
1.7%
Other values (102) 153
53.3%
Lowercase Letter
ValueCountFrequency (%)
c 1
33.3%
b 1
33.3%
a 1
33.3%
Space Separator
ValueCountFrequency (%)
10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 287
95.0%
Common 12
 
4.0%
Latin 3
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
13.2%
38
 
13.2%
15
 
5.2%
10
 
3.5%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.7%
5
 
1.7%
5
 
1.7%
Other values (102) 153
53.3%
Common
ValueCountFrequency (%)
10
83.3%
( 1
 
8.3%
) 1
 
8.3%
Latin
ValueCountFrequency (%)
c 1
33.3%
b 1
33.3%
a 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 287
95.0%
ASCII 15
 
5.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
38
 
13.2%
38
 
13.2%
15
 
5.2%
10
 
3.5%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.7%
5
 
1.7%
5
 
1.7%
Other values (102) 153
53.3%
ASCII
ValueCountFrequency (%)
10
66.7%
( 1
 
6.7%
c 1
 
6.7%
b 1
 
6.7%
a 1
 
6.7%
) 1
 
6.7%
Distinct47
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size520.0 B
2024-03-15T05:54:00.313261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length39
Mean length26.387755
Min length21

Characters and Unicode

Total characters1293
Distinct characters94
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)91.8%

Sample

1st row전북특별자치도 완주군 용진읍 구억명덕로 131
2nd row전북특별자치도 완주군 봉동읍 봉동로 140-1, 2층
3rd row전북특별자치도 완주군 삼례읍 삼례로 407, 1층
4th row전북특별자치도 완주군 삼례읍 역참로 80-1
5th row전북특별자치도 완주군 고산면 읍내7길 52-2
ValueCountFrequency (%)
전북특별자치도 49
18.1%
완주군 48
17.8%
봉동읍 21
 
7.8%
삼례읍 16
 
5.9%
2층 8
 
3.0%
봉동로 6
 
2.2%
동학로 5
 
1.9%
봉동동서로 5
 
1.9%
1층 5
 
1.9%
삼례로 4
 
1.5%
Other values (87) 103
38.1%
2024-03-15T05:54:01.697893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
221
 
17.1%
54
 
4.2%
52
 
4.0%
51
 
3.9%
50
 
3.9%
50
 
3.9%
49
 
3.8%
49
 
3.8%
49
 
3.8%
49
 
3.8%
Other values (84) 619
47.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 873
67.5%
Space Separator 221
 
17.1%
Decimal Number 166
 
12.8%
Other Punctuation 15
 
1.2%
Dash Punctuation 12
 
0.9%
Open Punctuation 3
 
0.2%
Close Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
6.2%
52
 
6.0%
51
 
5.8%
50
 
5.7%
50
 
5.7%
49
 
5.6%
49
 
5.6%
49
 
5.6%
49
 
5.6%
49
 
5.6%
Other values (68) 371
42.5%
Decimal Number
ValueCountFrequency (%)
1 39
23.5%
2 35
21.1%
5 17
10.2%
0 16
9.6%
3 13
 
7.8%
4 12
 
7.2%
9 10
 
6.0%
6 9
 
5.4%
7 8
 
4.8%
8 7
 
4.2%
Other Punctuation
ValueCountFrequency (%)
12
80.0%
, 3
 
20.0%
Space Separator
ValueCountFrequency (%)
221
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 873
67.5%
Common 420
32.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
6.2%
52
 
6.0%
51
 
5.8%
50
 
5.7%
50
 
5.7%
49
 
5.6%
49
 
5.6%
49
 
5.6%
49
 
5.6%
49
 
5.6%
Other values (68) 371
42.5%
Common
ValueCountFrequency (%)
221
52.6%
1 39
 
9.3%
2 35
 
8.3%
5 17
 
4.0%
0 16
 
3.8%
3 13
 
3.1%
12
 
2.9%
4 12
 
2.9%
- 12
 
2.9%
9 10
 
2.4%
Other values (6) 33
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 873
67.5%
ASCII 408
31.6%
None 12
 
0.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
221
54.2%
1 39
 
9.6%
2 35
 
8.6%
5 17
 
4.2%
0 16
 
3.9%
3 13
 
3.2%
4 12
 
2.9%
- 12
 
2.9%
9 10
 
2.5%
6 9
 
2.2%
Other values (5) 24
 
5.9%
Hangul
ValueCountFrequency (%)
54
 
6.2%
52
 
6.0%
51
 
5.8%
50
 
5.7%
50
 
5.7%
49
 
5.6%
49
 
5.6%
49
 
5.6%
49
 
5.6%
49
 
5.6%
Other values (68) 371
42.5%
None
ValueCountFrequency (%)
12
100.0%

사업소전화번호
Text

MISSING 

Distinct22
Distinct (%)100.0%
Missing27
Missing (%)55.1%
Memory size520.0 B
2024-03-15T05:54:02.420270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters264
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)100.0%

Sample

1st row063-228-3211
2nd row063-291-1557
3rd row063-263-2004
4th row063-221-9994
5th row063-227-9192
ValueCountFrequency (%)
063-228-3211 1
 
4.5%
063-291-1557 1
 
4.5%
063-262-1780 1
 
4.5%
063-261-4277 1
 
4.5%
063-290-1312 1
 
4.5%
063-262-1604 1
 
4.5%
063-263-0050 1
 
4.5%
063-291-1605 1
 
4.5%
063-261-1604 1
 
4.5%
063-261-8204 1
 
4.5%
Other values (12) 12
54.5%
2024-03-15T05:54:03.378649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 44
16.7%
0 41
15.5%
6 39
14.8%
2 34
12.9%
1 32
12.1%
3 29
11.0%
9 13
 
4.9%
5 9
 
3.4%
8 8
 
3.0%
4 8
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 220
83.3%
Dash Punctuation 44
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 41
18.6%
6 39
17.7%
2 34
15.5%
1 32
14.5%
3 29
13.2%
9 13
 
5.9%
5 9
 
4.1%
8 8
 
3.6%
4 8
 
3.6%
7 7
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 44
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 264
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 44
16.7%
0 41
15.5%
6 39
14.8%
2 34
12.9%
1 32
12.1%
3 29
11.0%
9 13
 
4.9%
5 9
 
3.4%
8 8
 
3.0%
4 8
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 264
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 44
16.7%
0 41
15.5%
6 39
14.8%
2 34
12.9%
1 32
12.1%
3 29
11.0%
9 13
 
4.9%
5 9
 
3.4%
8 8
 
3.0%
4 8
 
3.0%

Interactions

2024-03-15T05:53:54.228022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T05:54:03.646591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번유무료구분사업소명사업소주소사업소전화번호
연번1.0000.9721.0000.9691.000
유무료구분0.9721.0001.0001.0001.000
사업소명1.0001.0001.0001.0001.000
사업소주소0.9691.0001.0001.0001.000
사업소전화번호1.0001.0001.0001.0001.000
2024-03-15T05:54:03.905295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번유무료구분
연번1.0000.779
유무료구분0.7791.000

Missing values

2024-03-15T05:53:54.576338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T05:53:54.907238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번유무료구분사업소명사업소주소사업소전화번호
01유료참좋은일터전북특별자치도 완주군 용진읍 구억명덕로 131<NA>
12유료개상인력개발 봉동점전북특별자치도 완주군 봉동읍 봉동로 140-1, 2층063-228-3211
23유료대우인력전북특별자치도 완주군 삼례읍 삼례로 407, 1층<NA>
34유료공원인력공사전북특별자치도 완주군 삼례읍 역참로 80-1<NA>
45유료효성파워인력전북특별자치도 완주군 고산면 읍내7길 52-2<NA>
56유료주식회사 성웅전북특별자치도 완주군 봉동읍 봉동동서로 72(2동 101호)<NA>
67유료다와인력전북특별자치도 완주군 이서면 이서로 42063-291-1557
78유료행복드림 직업소개소전북특별자치도 완주군 봉동읍 완주산단9로 5, 2층<NA>
89유료녹색안전전북특별자치도 완주군 이서면 이서로 42<NA>
910유료미소인력전북특별자치도 완주군 비봉면 천호로 292<NA>
연번유무료구분사업소명사업소주소사업소전화번호
3940유료중앙인력전북특별자치도 완주군 삼례읍 역참로 12063-291-6400
4041유료현대건축 인력공사전북특별자치도 완주군 봉동읍 낙평동서로 54063-261-8204
4142유료완주인력사무소전북특별자치도 완주군 봉동읍 봉동로 125063-261-1604
4243유료삼강인력전북특별자치도 완주군 삼례읍 삼례로 360063-291-1605
4344유료봉동인력공사전북특별자치도 완주군 봉동읍 봉동동서로 153063-263-0050
4445유료선진인력전북특별자치도 완주군 봉동읍 봉동로 175063-262-1604
4546무료우석대학교산학협력단전북특별자치도 완주군 봉동읍 둔산3로 94, 완주군근로자종합복지관 2층063-290-1312
4647무료완주시니어클럽전북특별자치도 완주군 봉동읍 삼봉로 959063-261-4277
4748무료완주여성새로일하기센터전북특별자치도 완주군 봉동읍 과학로 850-15, 완주산업단지사무소 2층063-262-1780
4849무료완주장애인무료직업소개소전북특별자치도 완주군 봉동읍 낙평장기로 22063-261-7801