Overview

Dataset statistics

Number of variables5
Number of observations57
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory43.2 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description경북 영천시에 등록되어 있는 유무료 직업소개소 현황으로 연번, 요금, 상호, 지번주소, 도로명주소 등의 데이터를 제공합니다.
Author경상북도 영천시
URLhttps://www.data.go.kr/data/15084407/fileData.do

Alerts

유무료구분 is highly overall correlated with 전화번호High correlation
전화번호 is highly overall correlated with 유무료구분High correlation
유무료구분 is highly imbalanced (63.3%)Imbalance
순번 has unique valuesUnique
사업체명 has unique valuesUnique
사업소도로명주소 has unique valuesUnique

Reproduction

Analysis started2024-03-14 09:33:58.785865
Analysis finished2024-03-14 09:33:59.909814
Duration1.12 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct57
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29
Minimum1
Maximum57
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size641.0 B
2024-03-14T18:34:00.115538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.8
Q115
median29
Q343
95-th percentile54.2
Maximum57
Range56
Interquartile range (IQR)28

Descriptive statistics

Standard deviation16.598193
Coefficient of variation (CV)0.57235147
Kurtosis-1.2
Mean29
Median Absolute Deviation (MAD)14
Skewness0
Sum1653
Variance275.5
MonotonicityStrictly increasing
2024-03-14T18:34:00.535482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.8%
44 1
 
1.8%
32 1
 
1.8%
33 1
 
1.8%
34 1
 
1.8%
35 1
 
1.8%
36 1
 
1.8%
37 1
 
1.8%
38 1
 
1.8%
39 1
 
1.8%
Other values (47) 47
82.5%
ValueCountFrequency (%)
1 1
1.8%
2 1
1.8%
3 1
1.8%
4 1
1.8%
5 1
1.8%
6 1
1.8%
7 1
1.8%
8 1
1.8%
9 1
1.8%
10 1
1.8%
ValueCountFrequency (%)
57 1
1.8%
56 1
1.8%
55 1
1.8%
54 1
1.8%
53 1
1.8%
52 1
1.8%
51 1
1.8%
50 1
1.8%
49 1
1.8%
48 1
1.8%

유무료구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size584.0 B
유료
53 
무료
 
4

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유료
2nd row유료
3rd row무료
4th row유료
5th row유료

Common Values

ValueCountFrequency (%)
유료 53
93.0%
무료 4
 
7.0%

Length

2024-03-14T18:34:00.833611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T18:34:00.996643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유료 53
93.0%
무료 4
 
7.0%

사업체명
Text

UNIQUE 

Distinct57
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size584.0 B
2024-03-14T18:34:01.960866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length15
Mean length6.7017544
Min length2

Characters and Unicode

Total characters382
Distinct characters131
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)100.0%

Sample

1st row모두의 인력
2nd row화진개발
3rd row(재)경북하이브리드부품연구원
4th rowY.C 산업
5th row그린용역
ValueCountFrequency (%)
모두의 1
 
1.6%
별빛인력개발 1
 
1.6%
삼성인력 1
 
1.6%
서민용역(산업개발 1
 
1.6%
대림유료직업소개소 1
 
1.6%
대흥산업개발 1
 
1.6%
새영천용역 1
 
1.6%
88인력 1
 
1.6%
글로벌중앙 1
 
1.6%
봄빛인력개발 1
 
1.6%
Other values (53) 53
84.1%
2024-03-14T18:34:03.127966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30
 
7.9%
27
 
7.1%
25
 
6.5%
22
 
5.8%
16
 
4.2%
15
 
3.9%
11
 
2.9%
9
 
2.4%
9
 
2.4%
7
 
1.8%
Other values (121) 211
55.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 360
94.2%
Space Separator 6
 
1.6%
Uppercase Letter 5
 
1.3%
Decimal Number 4
 
1.0%
Close Punctuation 3
 
0.8%
Open Punctuation 3
 
0.8%
Other Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
8.3%
27
 
7.5%
25
 
6.9%
22
 
6.1%
16
 
4.4%
15
 
4.2%
11
 
3.1%
9
 
2.5%
9
 
2.5%
7
 
1.9%
Other values (110) 189
52.5%
Uppercase Letter
ValueCountFrequency (%)
C 2
40.0%
Y 1
20.0%
K 1
20.0%
O 1
20.0%
Decimal Number
ValueCountFrequency (%)
8 2
50.0%
1 1
25.0%
2 1
25.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 360
94.2%
Common 17
 
4.5%
Latin 5
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
8.3%
27
 
7.5%
25
 
6.9%
22
 
6.1%
16
 
4.4%
15
 
4.2%
11
 
3.1%
9
 
2.5%
9
 
2.5%
7
 
1.9%
Other values (110) 189
52.5%
Common
ValueCountFrequency (%)
6
35.3%
) 3
17.6%
( 3
17.6%
8 2
 
11.8%
. 1
 
5.9%
1 1
 
5.9%
2 1
 
5.9%
Latin
ValueCountFrequency (%)
C 2
40.0%
Y 1
20.0%
K 1
20.0%
O 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 360
94.2%
ASCII 22
 
5.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
30
 
8.3%
27
 
7.5%
25
 
6.9%
22
 
6.1%
16
 
4.4%
15
 
4.2%
11
 
3.1%
9
 
2.5%
9
 
2.5%
7
 
1.9%
Other values (110) 189
52.5%
ASCII
ValueCountFrequency (%)
6
27.3%
) 3
13.6%
( 3
13.6%
8 2
 
9.1%
C 2
 
9.1%
. 1
 
4.5%
Y 1
 
4.5%
K 1
 
4.5%
O 1
 
4.5%
1 1
 
4.5%

전화번호
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)29.8%
Missing0
Missing (%)0.0%
Memory size584.0 B
37 
<NA>
054-333-5335
 
1
 
1
054-335-5080
 
1
Other values (12)
12 

Length

Max length13
Median length1
Mean length4
Min length1

Unique

Unique15 ?
Unique (%)26.3%

Sample

1st row
2nd row
3rd row054-330-8000
4th row
5th row

Common Values

ValueCountFrequency (%)
37
64.9%
<NA> 5
 
8.8%
054-333-5335 1
 
1.8%
1
 
1.8%
054-335-5080 1
 
1.8%
054-338-2100 1
 
1.8%
070-4233-9090 1
 
1.8%
054-338-9909 1
 
1.8%
054-330-8000 1
 
1.8%
054-334-5419 1
 
1.8%
Other values (7) 7
 
12.3%

Length

2024-03-14T18:34:03.362159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 5
26.3%
054-333-5335 1
 
5.3%
054-335-5080 1
 
5.3%
054-338-2100 1
 
5.3%
070-4233-9090 1
 
5.3%
054-338-9909 1
 
5.3%
054-330-8000 1
 
5.3%
054-334-5419 1
 
5.3%
054-337-3223 1
 
5.3%
054-335-4444 1
 
5.3%
Other values (5) 5
26.3%
Distinct57
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size584.0 B
2024-03-14T18:34:04.474965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length33
Mean length23.929825
Min length19

Characters and Unicode

Total characters1364
Distinct characters103
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)100.0%

Sample

1st row경상북도 영천시 운동장로 14 2층 에이 25호 (교촌동)
2nd row경상북도 영천시 역전로 11 (완산동)
3rd row경상북도 영천시 괴연1길 24-24 (괴연동)
4th row경상북도 영천시 최무선로 280 3층 (과전동)
5th row경상북도 영천시 중앙동3길 102 (문내동)
ValueCountFrequency (%)
영천시 58
18.7%
경상북도 57
18.4%
야사동 14
 
4.5%
장수로 7
 
2.3%
2층 7
 
2.3%
금호읍 6
 
1.9%
1층 6
 
1.9%
완산동 6
 
1.9%
호국로 5
 
1.6%
화룡동 4
 
1.3%
Other values (115) 140
45.2%
2024-03-14T18:34:05.913569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
273
20.0%
61
 
4.5%
61
 
4.5%
60
 
4.4%
59
 
4.3%
59
 
4.3%
58
 
4.3%
57
 
4.2%
56
 
4.1%
) 45
 
3.3%
Other values (93) 575
42.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 801
58.7%
Space Separator 273
 
20.0%
Decimal Number 186
 
13.6%
Close Punctuation 45
 
3.3%
Open Punctuation 45
 
3.3%
Dash Punctuation 13
 
1.0%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
61
 
7.6%
61
 
7.6%
60
 
7.5%
59
 
7.4%
59
 
7.4%
58
 
7.2%
57
 
7.1%
56
 
7.0%
35
 
4.4%
22
 
2.7%
Other values (78) 273
34.1%
Decimal Number
ValueCountFrequency (%)
1 45
24.2%
2 32
17.2%
6 20
10.8%
4 19
10.2%
3 18
 
9.7%
7 15
 
8.1%
0 11
 
5.9%
5 9
 
4.8%
9 9
 
4.8%
8 8
 
4.3%
Space Separator
ValueCountFrequency (%)
273
100.0%
Close Punctuation
ValueCountFrequency (%)
) 45
100.0%
Open Punctuation
ValueCountFrequency (%)
( 45
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 801
58.7%
Common 562
41.2%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
61
 
7.6%
61
 
7.6%
60
 
7.5%
59
 
7.4%
59
 
7.4%
58
 
7.2%
57
 
7.1%
56
 
7.0%
35
 
4.4%
22
 
2.7%
Other values (78) 273
34.1%
Common
ValueCountFrequency (%)
273
48.6%
) 45
 
8.0%
1 45
 
8.0%
( 45
 
8.0%
2 32
 
5.7%
6 20
 
3.6%
4 19
 
3.4%
3 18
 
3.2%
7 15
 
2.7%
- 13
 
2.3%
Other values (4) 37
 
6.6%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 801
58.7%
ASCII 563
41.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
273
48.5%
) 45
 
8.0%
1 45
 
8.0%
( 45
 
8.0%
2 32
 
5.7%
6 20
 
3.6%
4 19
 
3.4%
3 18
 
3.2%
7 15
 
2.7%
- 13
 
2.3%
Other values (5) 38
 
6.7%
Hangul
ValueCountFrequency (%)
61
 
7.6%
61
 
7.6%
60
 
7.5%
59
 
7.4%
59
 
7.4%
58
 
7.2%
57
 
7.1%
56
 
7.0%
35
 
4.4%
22
 
2.7%
Other values (78) 273
34.1%

Interactions

2024-03-14T18:33:59.145462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T18:34:06.198485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번유무료구분사업체명전화번호사업소도로명주소
순번1.0000.2681.0000.2571.000
유무료구분0.2681.0001.0000.9291.000
사업체명1.0001.0001.0001.0001.000
전화번호0.2570.9291.0001.0001.000
사업소도로명주소1.0001.0001.0001.0001.000
2024-03-14T18:34:06.510239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유무료구분전화번호
유무료구분1.0000.672
전화번호0.6721.000
2024-03-14T18:34:06.759059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번유무료구분전화번호
순번1.0000.2060.000
유무료구분0.2061.0000.672
전화번호0.0000.6721.000

Missing values

2024-03-14T18:33:59.469507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T18:33:59.785406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번유무료구분사업체명전화번호사업소도로명주소
01유료모두의 인력경상북도 영천시 운동장로 14 2층 에이 25호 (교촌동)
12유료화진개발경상북도 영천시 역전로 11 (완산동)
23무료(재)경북하이브리드부품연구원054-330-8000경상북도 영천시 괴연1길 24-24 (괴연동)
34유료Y.C 산업경상북도 영천시 최무선로 280 3층 (과전동)
45유료그린용역경상북도 영천시 중앙동3길 102 (문내동)
56유료만평계절인력경상북도 영천시 신녕면 장수로 1638 2층
67유료황소조경인력개발경상북도 영천시 상록5길 61-4 2층 (야사동)
78유료행복인력개발경상북도 영천시 한방로 115 (작산동)
89유료화랑인력개발경상북도 영천시 도동구역길 183-1(도동)
910유료금오산업경상북도 영천시 청산길 17 무지개타운 제104동 나-7호 (문외동)
순번유무료구분사업체명전화번호사업소도로명주소
4748유료세원조경인력개발054-332-9525경상북도 영천시 천문로 266 (완산동)
4849유료영천인력산업개발경상북도 영천시 영화로 373 (망정동)
4950유료새가나개발직업소개소054-334-5419경상북도 영천시 운동장로 14 (교촌동)
5051유료현대인력054-337-3223경상북도 영천시 호국로 39 (야사동)
5152유료시민산업개발054-335-4444경상북도 영천시 문화새길 25 5동 2호 (야사동 문화아파트상가)
5253무료(사)한국농아인협회 경상북도협회 영천시지회054-331-0350경상북도 영천시 완산6길 19-6 (완산동)
5354유료영동산업개발054-332-7100경상북도 영천시 동문길 66 (문내동)
5455무료영천시장애인종합복지관054-333-3535경상북도 영천시 보목2길 10 (야사동)
5556유료태영개발 직업소개소경상북도 영천시 강변로 70 (금노동)
5657유료매일유료직업소개소054-337-5311경상북도 영천시 안야사2길 7 (야사동)