Overview

Dataset statistics

Number of variables5
Number of observations152
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.2 KiB
Average record size in memory41.9 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description대구 동구 관내에 위치한 직업소개소 현황입니다. 이 데이터는 법인명, 유무료구분, 사업소명, 운영상태, 주소등의 항목을 포함합니다.
Author대구광역시 동구
URLhttps://www.data.go.kr/data/3057602/fileData.do

Alerts

운영상태 has constant value ""Constant
유무료구분 is highly imbalanced (65.0%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:46:11.871211
Analysis finished2024-04-06 08:46:13.510078
Duration1.64 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct152
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean76.5
Minimum1
Maximum152
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2024-04-06T17:46:13.822485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.55
Q138.75
median76.5
Q3114.25
95-th percentile144.45
Maximum152
Range151
Interquartile range (IQR)75.5

Descriptive statistics

Standard deviation44.022721
Coefficient of variation (CV)0.57546041
Kurtosis-1.2
Mean76.5
Median Absolute Deviation (MAD)38
Skewness0
Sum11628
Variance1938
MonotonicityStrictly increasing
2024-04-06T17:46:14.380322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
106 1
 
0.7%
99 1
 
0.7%
100 1
 
0.7%
101 1
 
0.7%
102 1
 
0.7%
103 1
 
0.7%
104 1
 
0.7%
105 1
 
0.7%
107 1
 
0.7%
Other values (142) 142
93.4%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
152 1
0.7%
151 1
0.7%
150 1
0.7%
149 1
0.7%
148 1
0.7%
147 1
0.7%
146 1
0.7%
145 1
0.7%
144 1
0.7%
143 1
0.7%

유무료구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
유료
142 
무료
 
10

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유료
2nd row유료
3rd row유료
4th row유료
5th row유료

Common Values

ValueCountFrequency (%)
유료 142
93.4%
무료 10
 
6.6%

Length

2024-04-06T17:46:14.825977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:46:15.188744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유료 142
93.4%
무료 10
 
6.6%
Distinct151
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-04-06T17:46:15.792372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length17
Mean length7.9934211
Min length1

Characters and Unicode

Total characters1215
Distinct characters242
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique150 ?
Unique (%)98.7%

Sample

1st row하나로간병협회
2nd row성원인력
3rd row온누리 간병협회
4th row커플유료직업소개소
5th row넥스트엔(NextN)
ValueCountFrequency (%)
유료직업소개소 10
 
5.4%
직업소개소 8
 
4.3%
부림인력 2
 
1.1%
간병협회 2
 
1.1%
주식회사 2
 
1.1%
1
 
0.5%
동구여성문화공간 1
 
0.5%
수성 1
 
0.5%
비타민유료직업소개소 1
 
0.5%
미래로인력개발 1
 
0.5%
Other values (157) 157
84.4%
2024-04-06T17:46:16.907421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
115
 
9.5%
76
 
6.3%
63
 
5.2%
59
 
4.9%
56
 
4.6%
53
 
4.4%
38
 
3.1%
37
 
3.0%
34
 
2.8%
28
 
2.3%
Other values (232) 656
54.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1123
92.4%
Space Separator 34
 
2.8%
Uppercase Letter 18
 
1.5%
Close Punctuation 12
 
1.0%
Open Punctuation 12
 
1.0%
Lowercase Letter 11
 
0.9%
Decimal Number 3
 
0.2%
Dash Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
115
 
10.2%
76
 
6.8%
63
 
5.6%
59
 
5.3%
56
 
5.0%
53
 
4.7%
38
 
3.4%
37
 
3.3%
28
 
2.5%
27
 
2.4%
Other values (205) 571
50.8%
Uppercase Letter
ValueCountFrequency (%)
K 3
16.7%
H 2
11.1%
O 2
11.1%
A 2
11.1%
N 2
11.1%
E 2
11.1%
R 1
 
5.6%
S 1
 
5.6%
I 1
 
5.6%
V 1
 
5.6%
Lowercase Letter
ValueCountFrequency (%)
e 3
27.3%
v 1
 
9.1%
s 1
 
9.1%
r 1
 
9.1%
i 1
 
9.1%
l 1
 
9.1%
a 1
 
9.1%
t 1
 
9.1%
x 1
 
9.1%
Decimal Number
ValueCountFrequency (%)
3 1
33.3%
6 1
33.3%
5 1
33.3%
Space Separator
ValueCountFrequency (%)
34
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1123
92.4%
Common 63
 
5.2%
Latin 29
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
115
 
10.2%
76
 
6.8%
63
 
5.6%
59
 
5.3%
56
 
5.0%
53
 
4.7%
38
 
3.4%
37
 
3.3%
28
 
2.5%
27
 
2.4%
Other values (205) 571
50.8%
Latin
ValueCountFrequency (%)
e 3
 
10.3%
K 3
 
10.3%
H 2
 
6.9%
O 2
 
6.9%
A 2
 
6.9%
N 2
 
6.9%
E 2
 
6.9%
v 1
 
3.4%
s 1
 
3.4%
r 1
 
3.4%
Other values (10) 10
34.5%
Common
ValueCountFrequency (%)
34
54.0%
) 12
 
19.0%
( 12
 
19.0%
- 2
 
3.2%
3 1
 
1.6%
6 1
 
1.6%
5 1
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1123
92.4%
ASCII 92
 
7.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
115
 
10.2%
76
 
6.8%
63
 
5.6%
59
 
5.3%
56
 
5.0%
53
 
4.7%
38
 
3.4%
37
 
3.3%
28
 
2.5%
27
 
2.4%
Other values (205) 571
50.8%
ASCII
ValueCountFrequency (%)
34
37.0%
) 12
 
13.0%
( 12
 
13.0%
e 3
 
3.3%
K 3
 
3.3%
H 2
 
2.2%
- 2
 
2.2%
O 2
 
2.2%
A 2
 
2.2%
N 2
 
2.2%
Other values (17) 18
19.6%

운영상태
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
영업중
152 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업중
2nd row영업중
3rd row영업중
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 152
100.0%

Length

2024-04-06T17:46:17.579406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:46:17.912528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 152
100.0%
Distinct147
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-04-06T17:46:18.934791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length42
Mean length28.927632
Min length20

Characters and Unicode

Total characters4397
Distinct characters169
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique143 ?
Unique (%)94.1%

Sample

1st row대구광역시 동구 효동로1길 12. 105호 (효목동)
2nd row대구광역시 동구 안심로 389-3. 제이에스의류관 B동 101호 (신서동)
3rd row대구광역시 동구 아양로15길 23. 1층 (신암동)
4th row대구광역시 동구 효신로 51. 2층 (신천동)
5th row대구광역시 동구 경안로 826. 4층 (각산동)
ValueCountFrequency (%)
대구광역시 153
 
16.1%
동구 151
 
15.9%
2층 57
 
6.0%
신암동 42
 
4.4%
신천동 42
 
4.4%
1층 27
 
2.8%
3층 20
 
2.1%
아양로 18
 
1.9%
효목동 16
 
1.7%
동부로 10
 
1.1%
Other values (255) 414
43.6%
2024-04-06T17:46:20.314106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
798
18.1%
380
 
8.6%
314
 
7.1%
165
 
3.8%
2 156
 
3.5%
154
 
3.5%
154
 
3.5%
154
 
3.5%
153
 
3.5%
) 153
 
3.5%
Other values (159) 1816
41.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2417
55.0%
Space Separator 798
 
18.1%
Decimal Number 694
 
15.8%
Close Punctuation 153
 
3.5%
Open Punctuation 153
 
3.5%
Other Punctuation 149
 
3.4%
Dash Punctuation 29
 
0.7%
Lowercase Letter 2
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
380
15.7%
314
13.0%
165
 
6.8%
154
 
6.4%
154
 
6.4%
154
 
6.4%
153
 
6.3%
117
 
4.8%
110
 
4.6%
52
 
2.2%
Other values (140) 664
27.5%
Decimal Number
ValueCountFrequency (%)
2 156
22.5%
1 150
21.6%
3 86
12.4%
4 62
 
8.9%
0 56
 
8.1%
7 44
 
6.3%
6 43
 
6.2%
5 37
 
5.3%
8 35
 
5.0%
9 25
 
3.6%
Other Punctuation
ValueCountFrequency (%)
. 148
99.3%
, 1
 
0.7%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
A 1
50.0%
Space Separator
ValueCountFrequency (%)
798
100.0%
Close Punctuation
ValueCountFrequency (%)
) 153
100.0%
Open Punctuation
ValueCountFrequency (%)
( 153
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 29
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2417
55.0%
Common 1976
44.9%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
380
15.7%
314
13.0%
165
 
6.8%
154
 
6.4%
154
 
6.4%
154
 
6.4%
153
 
6.3%
117
 
4.8%
110
 
4.6%
52
 
2.2%
Other values (140) 664
27.5%
Common
ValueCountFrequency (%)
798
40.4%
2 156
 
7.9%
) 153
 
7.7%
( 153
 
7.7%
1 150
 
7.6%
. 148
 
7.5%
3 86
 
4.4%
4 62
 
3.1%
0 56
 
2.8%
7 44
 
2.2%
Other values (6) 170
 
8.6%
Latin
ValueCountFrequency (%)
e 2
50.0%
B 1
25.0%
A 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2417
55.0%
ASCII 1980
45.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
798
40.3%
2 156
 
7.9%
) 153
 
7.7%
( 153
 
7.7%
1 150
 
7.6%
. 148
 
7.5%
3 86
 
4.3%
4 62
 
3.1%
0 56
 
2.8%
7 44
 
2.2%
Other values (9) 174
 
8.8%
Hangul
ValueCountFrequency (%)
380
15.7%
314
13.0%
165
 
6.8%
154
 
6.4%
154
 
6.4%
154
 
6.4%
153
 
6.3%
117
 
4.8%
110
 
4.6%
52
 
2.2%
Other values (140) 664
27.5%

Interactions

2024-04-06T17:46:12.566817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:46:20.571434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번유무료구분
순번1.0000.000
유무료구분0.0001.000
2024-04-06T17:46:20.870635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번유무료구분
순번1.0000.000
유무료구분0.0001.000

Missing values

2024-04-06T17:46:13.006004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:46:13.404286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번유무료구분직업소개소명운영상태사업소도로명주소
01유료하나로간병협회영업중대구광역시 동구 효동로1길 12. 105호 (효목동)
12유료성원인력영업중대구광역시 동구 안심로 389-3. 제이에스의류관 B동 101호 (신서동)
23유료온누리 간병협회영업중대구광역시 동구 아양로15길 23. 1층 (신암동)
34유료커플유료직업소개소영업중대구광역시 동구 효신로 51. 2층 (신천동)
45유료넥스트엔(NextN)영업중대구광역시 동구 경안로 826. 4층 (각산동)
56유료대구 VISA KOREA영업중대구광역시 동구 효동로 3. 2층 (효목동)
67무료대구동구시니어클럽영업중대구광역시 동구 동촌로 325. 대한노인회 대구광역시 동구지회 4층 (신평동)
78유료두레인력개발영업중대구광역시 동구 아양로 174-1. 2층 (신암동)
89유료금조인력영업중대구광역시 동구 방천로1길 67-1. 2층 (불로동)
910유료우리인력영업중대구광역시 동구 안심로 448. 2층 (괴전동)
순번유무료구분직업소개소명운영상태사업소도로명주소
142143유료대림개발유료직업소개소영업중대구광역시 동구 효서로 37. 2층 (효목동)
143144유료대경어머니회영업중대구광역시 동구 아양로 122. 3층 (신암동)
144145유료일번지유료직업소개소영업중대구광역시 동구 큰고개로 67. 2층 (신암동)
145146유료황금인력개발영업중대구광역시 동구 신암로20길 54. 4층 (신암동)
146147유료경일건축인력영업중대구광역시 동구 송라로 23. 2층 (신천동)
147148유료서울개발유료직업소개소영업중대구광역시 동구 장등로 93. 2층 (신천동)
148149무료한국산업단지공단영업중대구광역시 동구 첨단로 39 (신서동)
149150유료대동건축인력영업중대구광역시 동구 동북로 411. 2층 (신암동)
150151유료우리어머니회영업중대구광역시 동구 장등로 3. 2층 (신천동)
151152유료대구동구서구남구중구북구경산달서구수성구롯데어머니회영업중대구광역시 동구 아양로 117 (신암동)