Overview

Dataset statistics

Number of variables5
Number of observations112
Missing cells83
Missing cells (%)14.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.6 KiB
Average record size in memory42.2 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description충청남도 아산시 관내 직업소개소 현황으로 직업소개소명, 대표자명, 사업소전화번호 , 유무료구분 등의 정보가 포함됩니다.
URLhttps://www.data.go.kr/data/3069711/fileData.do

Alerts

유무료구분 is highly imbalanced (69.9%)Imbalance
사업소전화번호 has 83 (74.1%) missing valuesMissing
순번 has unique valuesUnique
직업소개소명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:13:09.288166
Analysis finished2023-12-12 13:13:09.842407
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct112
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean57.6875
Minimum1
Maximum115
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-12T22:13:09.927132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.55
Q129.75
median57.5
Q386.25
95-th percentile109.45
Maximum115
Range114
Interquartile range (IQR)56.5

Descriptive statistics

Standard deviation33.228402
Coefficient of variation (CV)0.57600697
Kurtosis-1.1753173
Mean57.6875
Median Absolute Deviation (MAD)28.5
Skewness0.012661641
Sum6461
Variance1104.1267
MonotonicityStrictly increasing
2023-12-12T22:13:10.066769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
59 1
 
0.9%
86 1
 
0.9%
85 1
 
0.9%
83 1
 
0.9%
82 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
Other values (102) 102
91.1%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
115 1
0.9%
114 1
0.9%
113 1
0.9%
112 1
0.9%
111 1
0.9%
110 1
0.9%
109 1
0.9%
108 1
0.9%
107 1
0.9%
106 1
0.9%

유무료구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
유료
106 
무료
 
6

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유료
2nd row유료
3rd row유료
4th row유료
5th row유료

Common Values

ValueCountFrequency (%)
유료 106
94.6%
무료 6
 
5.4%

Length

2023-12-12T22:13:10.194839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:13:10.290433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유료 106
94.6%
무료 6
 
5.4%

직업소개소명
Text

UNIQUE 

Distinct112
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T22:13:10.585948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length18
Mean length6.1696429
Min length2

Characters and Unicode

Total characters691
Distinct characters175
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique112 ?
Unique (%)100.0%

Sample

1st row한길인력
2nd row아산전기인력
3rd row영림인력
4th row라온컨설팅
5th row조은인력
ValueCountFrequency (%)
주식회사 2
 
1.7%
아산인력 2
 
1.7%
한길인력 1
 
0.8%
창조인력사무소 1
 
0.8%
남산서비스 1
 
0.8%
신창인력개발 1
 
0.8%
로얄맘 1
 
0.8%
에이치티인력 1
 
0.8%
유성엔지니어링 1
 
0.8%
dh테크두현산업인력 1
 
0.8%
Other values (109) 109
90.1%
2023-12-12T22:13:11.026635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
79
 
11.4%
73
 
10.6%
29
 
4.2%
21
 
3.0%
17
 
2.5%
15
 
2.2%
15
 
2.2%
11
 
1.6%
11
 
1.6%
10
 
1.4%
Other values (165) 410
59.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 653
94.5%
Space Separator 9
 
1.3%
Uppercase Letter 9
 
1.3%
Open Punctuation 8
 
1.2%
Close Punctuation 8
 
1.2%
Decimal Number 2
 
0.3%
Other Punctuation 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
79
 
12.1%
73
 
11.2%
29
 
4.4%
21
 
3.2%
17
 
2.6%
15
 
2.3%
15
 
2.3%
11
 
1.7%
11
 
1.7%
10
 
1.5%
Other values (152) 372
57.0%
Uppercase Letter
ValueCountFrequency (%)
H 2
22.2%
O 2
22.2%
C 2
22.2%
D 1
11.1%
E 1
11.1%
K 1
11.1%
Decimal Number
ValueCountFrequency (%)
5 1
50.0%
0 1
50.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 653
94.5%
Common 29
 
4.2%
Latin 9
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
79
 
12.1%
73
 
11.2%
29
 
4.4%
21
 
3.2%
17
 
2.6%
15
 
2.3%
15
 
2.3%
11
 
1.7%
11
 
1.7%
10
 
1.5%
Other values (152) 372
57.0%
Common
ValueCountFrequency (%)
9
31.0%
( 8
27.6%
) 8
27.6%
. 1
 
3.4%
5 1
 
3.4%
- 1
 
3.4%
0 1
 
3.4%
Latin
ValueCountFrequency (%)
H 2
22.2%
O 2
22.2%
C 2
22.2%
D 1
11.1%
E 1
11.1%
K 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 653
94.5%
ASCII 38
 
5.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
79
 
12.1%
73
 
11.2%
29
 
4.4%
21
 
3.2%
17
 
2.6%
15
 
2.3%
15
 
2.3%
11
 
1.7%
11
 
1.7%
10
 
1.5%
Other values (152) 372
57.0%
ASCII
ValueCountFrequency (%)
9
23.7%
( 8
21.1%
) 8
21.1%
H 2
 
5.3%
O 2
 
5.3%
C 2
 
5.3%
D 1
 
2.6%
. 1
 
2.6%
5 1
 
2.6%
- 1
 
2.6%
Other values (3) 3
 
7.9%

사업소전화번호
Text

MISSING 

Distinct29
Distinct (%)100.0%
Missing83
Missing (%)74.1%
Memory size1.0 KiB
2023-12-12T22:13:11.254281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.034483
Min length12

Characters and Unicode

Total characters349
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)100.0%

Sample

1st row041-533-6048
2nd row041-548-0272
3rd row041-549-1410
4th row041-592-1700
5th row041-531-1198
ValueCountFrequency (%)
041-533-6048 1
 
3.4%
041-549-2111 1
 
3.4%
041-549-2580 1
 
3.4%
041-532-7273 1
 
3.4%
041-531-1682 1
 
3.4%
041--547-9444 1
 
3.4%
041-333-1989 1
 
3.4%
041-545-6373 1
 
3.4%
032-227-1112 1
 
3.4%
041-533-1330 1
 
3.4%
Other values (19) 19
65.5%
2023-12-12T22:13:11.707336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 59
16.9%
- 59
16.9%
4 57
16.3%
0 41
11.7%
5 37
10.6%
3 28
8.0%
2 21
 
6.0%
6 15
 
4.3%
8 13
 
3.7%
9 11
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 290
83.1%
Dash Punctuation 59
 
16.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 59
20.3%
4 57
19.7%
0 41
14.1%
5 37
12.8%
3 28
9.7%
2 21
 
7.2%
6 15
 
5.2%
8 13
 
4.5%
9 11
 
3.8%
7 8
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 59
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 349
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 59
16.9%
- 59
16.9%
4 57
16.3%
0 41
11.7%
5 37
10.6%
3 28
8.0%
2 21
 
6.0%
6 15
 
4.3%
8 13
 
3.7%
9 11
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 349
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 59
16.9%
- 59
16.9%
4 57
16.3%
0 41
11.7%
5 37
10.6%
3 28
8.0%
2 21
 
6.0%
6 15
 
4.3%
8 13
 
3.7%
9 11
 
3.2%
Distinct110
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T22:13:12.178625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length40
Mean length27.571429
Min length19

Characters and Unicode

Total characters3088
Distinct characters158
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)96.4%

Sample

1st row충청남도 아산시 배방읍 모산로 159
2nd row충청남도 아산시 신창면 행목로 76
3rd row충청남도 아산시 둔포면 충무로 1761
4th row충청남도 아산시 온천대로 1626-4. 201호 (풍기동)
5th row충청남도 아산시 둔포면 둔포중앙로 156
ValueCountFrequency (%)
충청남도 112
 
17.0%
아산시 111
 
16.9%
온천동 32
 
4.9%
배방읍 19
 
2.9%
둔포면 18
 
2.7%
2층 17
 
2.6%
1층 14
 
2.1%
충무로 11
 
1.7%
온천대로 10
 
1.5%
탕정면 6
 
0.9%
Other values (212) 307
46.7%
2023-12-12T22:13:12.819641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
545
 
17.6%
1 156
 
5.1%
136
 
4.4%
129
 
4.2%
124
 
4.0%
124
 
4.0%
119
 
3.9%
114
 
3.7%
114
 
3.7%
102
 
3.3%
Other values (148) 1425
46.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1789
57.9%
Space Separator 545
 
17.6%
Decimal Number 542
 
17.6%
Other Punctuation 67
 
2.2%
Close Punctuation 56
 
1.8%
Open Punctuation 56
 
1.8%
Dash Punctuation 32
 
1.0%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
136
 
7.6%
129
 
7.2%
124
 
6.9%
124
 
6.9%
119
 
6.7%
114
 
6.4%
114
 
6.4%
102
 
5.7%
65
 
3.6%
53
 
3.0%
Other values (132) 709
39.6%
Decimal Number
ValueCountFrequency (%)
1 156
28.8%
2 82
15.1%
0 47
 
8.7%
3 44
 
8.1%
4 44
 
8.1%
6 43
 
7.9%
7 37
 
6.8%
9 34
 
6.3%
5 30
 
5.5%
8 25
 
4.6%
Space Separator
ValueCountFrequency (%)
545
100.0%
Other Punctuation
ValueCountFrequency (%)
. 67
100.0%
Close Punctuation
ValueCountFrequency (%)
) 56
100.0%
Open Punctuation
ValueCountFrequency (%)
( 56
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1789
57.9%
Common 1298
42.0%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
136
 
7.6%
129
 
7.2%
124
 
6.9%
124
 
6.9%
119
 
6.7%
114
 
6.4%
114
 
6.4%
102
 
5.7%
65
 
3.6%
53
 
3.0%
Other values (132) 709
39.6%
Common
ValueCountFrequency (%)
545
42.0%
1 156
 
12.0%
2 82
 
6.3%
. 67
 
5.2%
) 56
 
4.3%
( 56
 
4.3%
0 47
 
3.6%
3 44
 
3.4%
4 44
 
3.4%
6 43
 
3.3%
Other values (5) 158
 
12.2%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1789
57.9%
ASCII 1299
42.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
545
42.0%
1 156
 
12.0%
2 82
 
6.3%
. 67
 
5.2%
) 56
 
4.3%
( 56
 
4.3%
0 47
 
3.6%
3 44
 
3.4%
4 44
 
3.4%
6 43
 
3.3%
Other values (6) 159
 
12.2%
Hangul
ValueCountFrequency (%)
136
 
7.6%
129
 
7.2%
124
 
6.9%
124
 
6.9%
119
 
6.7%
114
 
6.4%
114
 
6.4%
102
 
5.7%
65
 
3.6%
53
 
3.0%
Other values (132) 709
39.6%

Interactions

2023-12-12T22:13:09.543797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:13:12.919625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번유무료구분사업소전화번호
순번1.0000.1891.000
유무료구분0.1891.0001.000
사업소전화번호1.0001.0001.000
2023-12-12T22:13:13.035223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번유무료구분
순번1.0000.137
유무료구분0.1371.000

Missing values

2023-12-12T22:13:09.663717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:13:09.804899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번유무료구분직업소개소명사업소전화번호사업소도로명주소
01유료한길인력<NA>충청남도 아산시 배방읍 모산로 159
12유료아산전기인력<NA>충청남도 아산시 신창면 행목로 76
23유료영림인력<NA>충청남도 아산시 둔포면 충무로 1761
34유료라온컨설팅<NA>충청남도 아산시 온천대로 1626-4. 201호 (풍기동)
45유료조은인력<NA>충청남도 아산시 둔포면 둔포중앙로 156
56유료아산.천안인력개발<NA>충청남도 아산시 배방읍 배방로187번길 16-2
67유료대영인력<NA>충청남도 아산시 삼동로86번길 25-1(권곡동)
78유료정석인력<NA>충청남도 아산시 번영로 129(온천동)
89유료제이엠인력<NA>충청남도 아산시 둔포면 둔포면로17번길 20-1
910유료대동인력041-533-6048충청남도 아산시 온천대로 1425. 모범세탁 (온천동)
순번유무료구분직업소개소명사업소전화번호사업소도로명주소
102106유료(주)온양인력소개소<NA>충청남도 아산시 중앙로 24. 지강학원 (온천동)
103107유료일터사랑<NA>충청남도 아산시 남산로 98 (온천동)
104108유료미래여성전문인력041--547-9444충청남도 아산시 충무로20번길 6 (온천동)
105109유료모산직업소개소 인력<NA>충청남도 아산시 온천대로 1559. 대웅식당 (온천동)
106110유료거산인력041-531-1682충청남도 아산시 둔포면 충무로 1804
107111유료고려직업소개소041-532-7273충청남도 아산시 둔포면 충무로 1734
108112유료태평양건설직업컨설팅<NA>충청남도 아산시 삼동로86번길 16 (모종동)
109113무료한국장애인사랑나눔협회 아산시장애인일자리지원센터041-549-2580충청남도 아산시 남산로 96-14. 1층 101호 (온천동)
110114유료충남인력<NA>충청남도 아산시 온양역길 146-2. 2층 (온천동)
111115무료(사)충청남도지체장애인협회아산지회041-546-1515충청남도 아산시 번영로143번길 36 (권곡동)