Overview

Dataset statistics

Number of variables6
Number of observations73
Missing cells41
Missing cells (%)9.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.6 KiB
Average record size in memory50.8 B

Variable types

Numeric1
Categorical1
Text3
DateTime1

Dataset

Description대구광역시 북구의 직업소개소 현황 데이터로 직업소개소의 유무료구분, 사업자명, 도로명주소(위치), 전화번호 정보를 제공합니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15113964/fileData.do

Alerts

데이터 기준일자 has constant value ""Constant
유무료구분 is highly imbalanced (50.1%)Imbalance
전화번호 has 41 (56.2%) missing valuesMissing
연번 has unique valuesUnique
사업자명 has unique valuesUnique

Reproduction

Analysis started2024-04-21 11:27:36.920455
Analysis finished2024-04-21 11:27:38.424375
Duration1.5 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct73
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37
Minimum1
Maximum73
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size785.0 B
2024-04-21T20:27:38.553363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.6
Q119
median37
Q355
95-th percentile69.4
Maximum73
Range72
Interquartile range (IQR)36

Descriptive statistics

Standard deviation21.217131
Coefficient of variation (CV)0.57343598
Kurtosis-1.2
Mean37
Median Absolute Deviation (MAD)18
Skewness0
Sum2701
Variance450.16667
MonotonicityStrictly increasing
2024-04-21T20:27:38.840892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.4%
56 1
 
1.4%
54 1
 
1.4%
53 1
 
1.4%
52 1
 
1.4%
51 1
 
1.4%
50 1
 
1.4%
49 1
 
1.4%
48 1
 
1.4%
47 1
 
1.4%
Other values (63) 63
86.3%
ValueCountFrequency (%)
1 1
1.4%
2 1
1.4%
3 1
1.4%
4 1
1.4%
5 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
9 1
1.4%
10 1
1.4%
ValueCountFrequency (%)
73 1
1.4%
72 1
1.4%
71 1
1.4%
70 1
1.4%
69 1
1.4%
68 1
1.4%
67 1
1.4%
66 1
1.4%
65 1
1.4%
64 1
1.4%

유무료구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size712.0 B
유료
65 
무료

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유료
2nd row유료
3rd row유료
4th row유료
5th row유료

Common Values

ValueCountFrequency (%)
유료 65
89.0%
무료 8
 
11.0%

Length

2024-04-21T20:27:39.081871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T20:27:39.261405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유료 65
89.0%
무료 8
 
11.0%

사업자명
Text

UNIQUE 

Distinct73
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size712.0 B
2024-04-21T20:27:40.046851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length14
Mean length7.7534247
Min length2

Characters and Unicode

Total characters566
Distinct characters182
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)100.0%

Sample

1st row네이버
2nd row글로벌 컨설팅(Global Consulting)
3rd row일오삼 인력 개발
4th row미래인력
5th row대구여성인력센터 홈사랑서비스
ValueCountFrequency (%)
유료직업소개소 4
 
4.4%
직업소개소 2
 
2.2%
주식회사 2
 
2.2%
소야유료직업소개소 1
 
1.1%
샤넬 1
 
1.1%
대승인력개발 1
 
1.1%
우림간병센터 1
 
1.1%
강북1번직업소개소 1
 
1.1%
하나로인력직업소개소 1
 
1.1%
㈜일로이룸 1
 
1.1%
Other values (75) 75
83.3%
2024-04-21T20:27:41.096824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
7.4%
31
 
5.5%
26
 
4.6%
25
 
4.4%
22
 
3.9%
20
 
3.5%
17
 
3.0%
14
 
2.5%
14
 
2.5%
12
 
2.1%
Other values (172) 343
60.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 506
89.4%
Space Separator 17
 
3.0%
Lowercase Letter 14
 
2.5%
Open Punctuation 8
 
1.4%
Close Punctuation 8
 
1.4%
Uppercase Letter 6
 
1.1%
Decimal Number 5
 
0.9%
Other Symbol 1
 
0.2%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
8.3%
31
 
6.1%
26
 
5.1%
25
 
4.9%
22
 
4.3%
20
 
4.0%
14
 
2.8%
14
 
2.8%
12
 
2.4%
11
 
2.2%
Other values (148) 289
57.1%
Lowercase Letter
ValueCountFrequency (%)
l 3
21.4%
o 2
14.3%
n 2
14.3%
g 1
 
7.1%
i 1
 
7.1%
t 1
 
7.1%
u 1
 
7.1%
s 1
 
7.1%
a 1
 
7.1%
b 1
 
7.1%
Uppercase Letter
ValueCountFrequency (%)
G 2
33.3%
Y 1
16.7%
C 1
16.7%
K 1
16.7%
B 1
16.7%
Decimal Number
ValueCountFrequency (%)
3 2
40.0%
1 1
20.0%
5 1
20.0%
6 1
20.0%
Space Separator
ValueCountFrequency (%)
17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 507
89.6%
Common 39
 
6.9%
Latin 20
 
3.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
8.3%
31
 
6.1%
26
 
5.1%
25
 
4.9%
22
 
4.3%
20
 
3.9%
14
 
2.8%
14
 
2.8%
12
 
2.4%
11
 
2.2%
Other values (149) 290
57.2%
Latin
ValueCountFrequency (%)
l 3
15.0%
o 2
 
10.0%
n 2
 
10.0%
G 2
 
10.0%
Y 1
 
5.0%
g 1
 
5.0%
i 1
 
5.0%
t 1
 
5.0%
u 1
 
5.0%
s 1
 
5.0%
Other values (5) 5
25.0%
Common
ValueCountFrequency (%)
17
43.6%
( 8
20.5%
) 8
20.5%
3 2
 
5.1%
1 1
 
2.6%
5 1
 
2.6%
. 1
 
2.6%
6 1
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 506
89.4%
ASCII 59
 
10.4%
None 1
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
8.3%
31
 
6.1%
26
 
5.1%
25
 
4.9%
22
 
4.3%
20
 
4.0%
14
 
2.8%
14
 
2.8%
12
 
2.4%
11
 
2.2%
Other values (148) 289
57.1%
ASCII
ValueCountFrequency (%)
17
28.8%
( 8
13.6%
) 8
13.6%
l 3
 
5.1%
o 2
 
3.4%
n 2
 
3.4%
G 2
 
3.4%
3 2
 
3.4%
Y 1
 
1.7%
1 1
 
1.7%
Other values (13) 13
22.0%
None
ValueCountFrequency (%)
1
100.0%
Distinct70
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size712.0 B
2024-04-21T20:27:42.070702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length41
Mean length29.739726
Min length22

Characters and Unicode

Total characters2171
Distinct characters104
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)91.8%

Sample

1st row대구광역시 북구 경진로1길 55-1, 1층 (복현동)
2nd row대구광역시 북구 원대로 113, 5층 (노원동1가)
3rd row대구광역시 북구 산격로1길 21, 지하1층 (산격동)
4th row대구광역시 북구 노원로10길 40, 상가동 109호 (노원동3가, 대구노원한신더휴)
5th row대구광역시 북구 신암로 111, 303호 (대현동)
ValueCountFrequency (%)
대구광역시 73
 
16.0%
북구 73
 
16.0%
2층 26
 
5.7%
산격동 14
 
3.1%
복현동 12
 
2.6%
1층 8
 
1.8%
동북로 8
 
1.8%
칠성동2가 6
 
1.3%
3층 6
 
1.3%
읍내동 6
 
1.3%
Other values (148) 225
49.2%
2024-04-21T20:27:43.285364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
385
 
17.7%
151
 
7.0%
102
 
4.7%
97
 
4.5%
2 87
 
4.0%
82
 
3.8%
74
 
3.4%
) 73
 
3.4%
73
 
3.4%
( 73
 
3.4%
Other values (94) 974
44.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1200
55.3%
Space Separator 385
 
17.7%
Decimal Number 358
 
16.5%
Close Punctuation 73
 
3.4%
Open Punctuation 73
 
3.4%
Other Punctuation 70
 
3.2%
Dash Punctuation 11
 
0.5%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
151
 
12.6%
102
 
8.5%
97
 
8.1%
82
 
6.8%
74
 
6.2%
73
 
6.1%
73
 
6.1%
72
 
6.0%
51
 
4.2%
26
 
2.2%
Other values (78) 399
33.2%
Decimal Number
ValueCountFrequency (%)
2 87
24.3%
1 72
20.1%
3 47
13.1%
0 41
11.5%
5 26
 
7.3%
7 25
 
7.0%
8 18
 
5.0%
4 17
 
4.7%
6 14
 
3.9%
9 11
 
3.1%
Space Separator
ValueCountFrequency (%)
385
100.0%
Close Punctuation
ValueCountFrequency (%)
) 73
100.0%
Open Punctuation
ValueCountFrequency (%)
( 73
100.0%
Other Punctuation
ValueCountFrequency (%)
, 70
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1200
55.3%
Common 970
44.7%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
151
 
12.6%
102
 
8.5%
97
 
8.1%
82
 
6.8%
74
 
6.2%
73
 
6.1%
73
 
6.1%
72
 
6.0%
51
 
4.2%
26
 
2.2%
Other values (78) 399
33.2%
Common
ValueCountFrequency (%)
385
39.7%
2 87
 
9.0%
) 73
 
7.5%
( 73
 
7.5%
1 72
 
7.4%
, 70
 
7.2%
3 47
 
4.8%
0 41
 
4.2%
5 26
 
2.7%
7 25
 
2.6%
Other values (5) 71
 
7.3%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1200
55.3%
ASCII 971
44.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
385
39.6%
2 87
 
9.0%
) 73
 
7.5%
( 73
 
7.5%
1 72
 
7.4%
, 70
 
7.2%
3 47
 
4.8%
0 41
 
4.2%
5 26
 
2.7%
7 25
 
2.6%
Other values (6) 72
 
7.4%
Hangul
ValueCountFrequency (%)
151
 
12.6%
102
 
8.5%
97
 
8.1%
82
 
6.8%
74
 
6.2%
73
 
6.1%
73
 
6.1%
72
 
6.0%
51
 
4.2%
26
 
2.2%
Other values (78) 399
33.2%

전화번호
Text

MISSING 

Distinct32
Distinct (%)100.0%
Missing41
Missing (%)56.2%
Memory size712.0 B
2024-04-21T20:27:43.975118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.90625
Min length9

Characters and Unicode

Total characters381
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)100.0%

Sample

1st row053-426-7996
2nd row053-312-1182
3rd row1600-9609
4th row053-341-0709
5th row053-257-0260
ValueCountFrequency (%)
053-426-7996 1
 
3.1%
053-312-1182 1
 
3.1%
053-311-8484 1
 
3.1%
053-953-3625 1
 
3.1%
053-381-1677 1
 
3.1%
053-423-6243 1
 
3.1%
053-353-8784 1
 
3.1%
053-383-6645 1
 
3.1%
053-313-9393 1
 
3.1%
053-323-7070 1
 
3.1%
Other values (22) 22
68.8%
2024-04-21T20:27:44.879833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 68
17.8%
- 63
16.5%
0 57
15.0%
5 49
12.9%
4 25
 
6.6%
1 24
 
6.3%
2 22
 
5.8%
9 22
 
5.8%
7 18
 
4.7%
8 17
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 318
83.5%
Dash Punctuation 63
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 68
21.4%
0 57
17.9%
5 49
15.4%
4 25
 
7.9%
1 24
 
7.5%
2 22
 
6.9%
9 22
 
6.9%
7 18
 
5.7%
8 17
 
5.3%
6 16
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 63
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 381
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 68
17.8%
- 63
16.5%
0 57
15.0%
5 49
12.9%
4 25
 
6.6%
1 24
 
6.3%
2 22
 
5.8%
9 22
 
5.8%
7 18
 
4.7%
8 17
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 381
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 68
17.8%
- 63
16.5%
0 57
15.0%
5 49
12.9%
4 25
 
6.6%
1 24
 
6.3%
2 22
 
5.8%
9 22
 
5.8%
7 18
 
4.7%
8 17
 
4.5%

데이터 기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size712.0 B
Minimum2023-05-19 00:00:00
Maximum2023-05-19 00:00:00
2024-04-21T20:27:45.068039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T20:27:45.231623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-04-21T20:27:37.861145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T20:27:45.356469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번유무료구분사업자명도로명주소전화번호
연번1.0000.0001.0000.8281.000
유무료구분0.0001.0001.0001.0001.000
사업자명1.0001.0001.0001.0001.000
도로명주소0.8281.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
2024-04-21T20:27:45.518245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번유무료구분
연번1.0000.000
유무료구분0.0001.000

Missing values

2024-04-21T20:27:38.164326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T20:27:38.350961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번유무료구분사업자명도로명주소전화번호데이터 기준일자
01유료네이버대구광역시 북구 경진로1길 55-1, 1층 (복현동)<NA>2023-05-19
12유료글로벌 컨설팅(Global Consulting)대구광역시 북구 원대로 113, 5층 (노원동1가)<NA>2023-05-19
23유료일오삼 인력 개발대구광역시 북구 산격로1길 21, 지하1층 (산격동)<NA>2023-05-19
34유료미래인력대구광역시 북구 노원로10길 40, 상가동 109호 (노원동3가, 대구노원한신더휴)<NA>2023-05-19
45유료대구여성인력센터 홈사랑서비스대구광역시 북구 신암로 111, 303호 (대현동)<NA>2023-05-19
56유료신호등대구광역시 북구 대동로5길 6-2, 1층 (산격동)<NA>2023-05-19
67유료(주)타코마인력대구광역시 북구 대현로 62, 2층 (대현동)053-426-79962023-05-19
78유료에베레스트인력대구광역시 북구 환성정길 12, 2층 (서변동)<NA>2023-05-19
89유료레몬직업소개소대구광역시 북구 동천로24길 13, 3층 (동천동)<NA>2023-05-19
910유료도담간병지원센터대구광역시 북구 관음중앙로28길 31, 1층 (관음동)<NA>2023-05-19
연번유무료구분사업자명도로명주소전화번호데이터 기준일자
6364유료(주)제인브라더스대구광역시 북구 대학로 61, 4층 (산격동)<NA>2023-05-19
6465유료YG대구광역시 북구 대구체육관로 32, 2층 (산격동)<NA>2023-05-19
6566유료굳유료직업소개소대구광역시 북구 칠성남로37길 33, 2층 (칠성동2가)<NA>2023-05-19
6667무료새희망고용지원센터대구광역시 북구 칠성남로38길 22 (칠성동2가)053-423-62432023-05-19
6768유료성우인력개발대구광역시 북구 연암로 183, 209호 (산격동, 산격주공아파트)053-381-16772023-05-19
6869유료핑클 유료직업소개소대구광역시 북구 경진로 30, 2층 201호 (복현동)<NA>2023-05-19
6970유료대화건축인력대구광역시 북구 대현로 118-2 (대현동)053-953-36252023-05-19
7071유료자유산업개발대구광역시 북구 연암로 183, 7702동 205호 (산격동, 산격주공아파트)<NA>2023-05-19
7172유료대구어머니회유료직업소개소대구광역시 북구 원대로 80 (고성동3가)053-311-84842023-05-19
7273유료쌍경인력대구광역시 북구 침산남로 208, 2층 (침산동)053-358-30122023-05-19