Overview

Dataset statistics

Number of variables6
Number of observations91
Missing cells71
Missing cells (%)13.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.5 KiB
Average record size in memory50.5 B

Variable types

Numeric1
Categorical1
Text3
DateTime1

Dataset

Description경상남도 거제시 직업소개소 현황(2022. 8. 8.기준)에 대한 데이터로, 관내 직업소개소 등록 현황을 안내드립니다.
URLhttps://www.data.go.kr/data/15103406/fileData.do

Alerts

기준일 has constant value ""Constant
유무료구분 is highly imbalanced (91.3%)Imbalance
전화번호 has 71 (78.0%) missing valuesMissing
순번 has unique valuesUnique
법인명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:22:38.877038
Analysis finished2023-12-12 20:22:39.663576
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct91
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46
Minimum1
Maximum91
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size951.0 B
2023-12-13T05:22:39.773168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.5
Q123.5
median46
Q368.5
95-th percentile86.5
Maximum91
Range90
Interquartile range (IQR)45

Descriptive statistics

Standard deviation26.41338
Coefficient of variation (CV)0.57420392
Kurtosis-1.2
Mean46
Median Absolute Deviation (MAD)23
Skewness0
Sum4186
Variance697.66667
MonotonicityStrictly increasing
2023-12-13T05:22:39.966220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
59 1
 
1.1%
68 1
 
1.1%
67 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
Other values (81) 81
89.0%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
91 1
1.1%
90 1
1.1%
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%

유무료구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size860.0 B
유료
90 
무료
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)1.1%

Sample

1st row유료
2nd row유료
3rd row유료
4th row유료
5th row유료

Common Values

ValueCountFrequency (%)
유료 90
98.9%
무료 1
 
1.1%

Length

2023-12-13T05:22:40.085204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:22:40.178362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유료 90
98.9%
무료 1
 
1.1%

법인명
Text

UNIQUE 

Distinct91
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size860.0 B
2023-12-13T05:22:40.391459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length11
Mean length7
Min length2

Characters and Unicode

Total characters637
Distinct characters170
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique91 ?
Unique (%)100.0%

Sample

1st row매일인력
2nd row대지인력
3rd row고현인력
4th row주식회사 코러스인터네셔널
5th row이피에이코비스코리아(주)
ValueCountFrequency (%)
직업소개소 9
 
7.8%
주식회사 5
 
4.3%
인력개발 2
 
1.7%
파트너스 1
 
0.9%
거제가사원 1
 
0.9%
으뜸인력가사원 1
 
0.9%
조선플랜트엔지니어링 1
 
0.9%
대우인력 1
 
0.9%
여우직업소개소 1
 
0.9%
통합인력 1
 
0.9%
Other values (92) 92
80.0%
2023-12-13T05:22:40.812598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
57
 
8.9%
36
 
5.7%
34
 
5.3%
32
 
5.0%
28
 
4.4%
27
 
4.2%
24
 
3.8%
18
 
2.8%
13
 
2.0%
12
 
1.9%
Other values (160) 356
55.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 552
86.7%
Uppercase Letter 30
 
4.7%
Space Separator 24
 
3.8%
Open Punctuation 9
 
1.4%
Close Punctuation 9
 
1.4%
Lowercase Letter 5
 
0.8%
Decimal Number 4
 
0.6%
Other Punctuation 3
 
0.5%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
10.3%
36
 
6.5%
34
 
6.2%
32
 
5.8%
28
 
5.1%
27
 
4.9%
18
 
3.3%
13
 
2.4%
12
 
2.2%
12
 
2.2%
Other values (131) 283
51.3%
Uppercase Letter
ValueCountFrequency (%)
B 4
13.3%
A 4
13.3%
E 4
13.3%
K 3
10.0%
O 3
10.0%
G 2
 
6.7%
R 2
 
6.7%
V 1
 
3.3%
Y 1
 
3.3%
P 1
 
3.3%
Other values (5) 5
16.7%
Lowercase Letter
ValueCountFrequency (%)
y 1
20.0%
c 1
20.0%
n 1
20.0%
e 1
20.0%
g 1
20.0%
Other Punctuation
ValueCountFrequency (%)
. 1
33.3%
· 1
33.3%
& 1
33.3%
Decimal Number
ValueCountFrequency (%)
0 3
75.0%
2 1
 
25.0%
Space Separator
ValueCountFrequency (%)
24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 552
86.7%
Common 50
 
7.8%
Latin 35
 
5.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
10.3%
36
 
6.5%
34
 
6.2%
32
 
5.8%
28
 
5.1%
27
 
4.9%
18
 
3.3%
13
 
2.4%
12
 
2.2%
12
 
2.2%
Other values (131) 283
51.3%
Latin
ValueCountFrequency (%)
B 4
 
11.4%
A 4
 
11.4%
E 4
 
11.4%
K 3
 
8.6%
O 3
 
8.6%
G 2
 
5.7%
R 2
 
5.7%
V 1
 
2.9%
Y 1
 
2.9%
P 1
 
2.9%
Other values (10) 10
28.6%
Common
ValueCountFrequency (%)
24
48.0%
( 9
 
18.0%
) 9
 
18.0%
0 3
 
6.0%
2 1
 
2.0%
. 1
 
2.0%
· 1
 
2.0%
& 1
 
2.0%
- 1
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 552
86.7%
ASCII 84
 
13.2%
None 1
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
57
 
10.3%
36
 
6.5%
34
 
6.2%
32
 
5.8%
28
 
5.1%
27
 
4.9%
18
 
3.3%
13
 
2.4%
12
 
2.2%
12
 
2.2%
Other values (131) 283
51.3%
ASCII
ValueCountFrequency (%)
24
28.6%
( 9
 
10.7%
) 9
 
10.7%
B 4
 
4.8%
A 4
 
4.8%
E 4
 
4.8%
0 3
 
3.6%
K 3
 
3.6%
O 3
 
3.6%
G 2
 
2.4%
Other values (18) 19
22.6%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct90
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size860.0 B
2023-12-13T05:22:41.105222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length42
Mean length30.89011
Min length9

Characters and Unicode

Total characters2811
Distinct characters144
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)97.8%

Sample

1st row경상남도 거제시 거제중앙로20길 15-1. 102호 (고현동. 일진주택)
2nd row경상남도 거제시 장평3로3길 18. 101호 (장평동)
3rd row경상남도 거제시 고현로 96. 2층 201호 (고현동)
4th row경상남도 거제시 옥포성안로 71. 2층 201호 (옥포동. 이던플레이스)
5th row경상남도 거제시 거제대로 4779-5. 거제조경공사 2층 201호 (고현동)
ValueCountFrequency (%)
경상남도 90
 
15.9%
거제시 90
 
15.9%
고현동 44
 
7.8%
옥포동 20
 
3.5%
2층 19
 
3.4%
1층 15
 
2.6%
아주동 9
 
1.6%
거제중앙로 9
 
1.6%
고현로 7
 
1.2%
장평동 6
 
1.1%
Other values (184) 258
45.5%
2023-12-13T05:22:41.513557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
476
 
16.9%
1 147
 
5.2%
120
 
4.3%
120
 
4.3%
97
 
3.5%
96
 
3.4%
93
 
3.3%
92
 
3.3%
90
 
3.2%
90
 
3.2%
Other values (134) 1390
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1578
56.1%
Space Separator 476
 
16.9%
Decimal Number 476
 
16.9%
Close Punctuation 86
 
3.1%
Open Punctuation 86
 
3.1%
Other Punctuation 82
 
2.9%
Dash Punctuation 25
 
0.9%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
120
 
7.6%
120
 
7.6%
97
 
6.1%
96
 
6.1%
93
 
5.9%
92
 
5.8%
90
 
5.7%
90
 
5.7%
86
 
5.4%
61
 
3.9%
Other values (117) 633
40.1%
Decimal Number
ValueCountFrequency (%)
1 147
30.9%
2 81
17.0%
0 54
 
11.3%
3 47
 
9.9%
7 29
 
6.1%
4 28
 
5.9%
5 27
 
5.7%
6 26
 
5.5%
8 20
 
4.2%
9 17
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
C 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
476
100.0%
Close Punctuation
ValueCountFrequency (%)
) 86
100.0%
Open Punctuation
ValueCountFrequency (%)
( 86
100.0%
Other Punctuation
ValueCountFrequency (%)
. 82
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1578
56.1%
Common 1231
43.8%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
120
 
7.6%
120
 
7.6%
97
 
6.1%
96
 
6.1%
93
 
5.9%
92
 
5.8%
90
 
5.7%
90
 
5.7%
86
 
5.4%
61
 
3.9%
Other values (117) 633
40.1%
Common
ValueCountFrequency (%)
476
38.7%
1 147
 
11.9%
) 86
 
7.0%
( 86
 
7.0%
. 82
 
6.7%
2 81
 
6.6%
0 54
 
4.4%
3 47
 
3.8%
7 29
 
2.4%
4 28
 
2.3%
Other values (5) 115
 
9.3%
Latin
ValueCountFrequency (%)
C 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1578
56.1%
ASCII 1233
43.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
476
38.6%
1 147
 
11.9%
) 86
 
7.0%
( 86
 
7.0%
. 82
 
6.7%
2 81
 
6.6%
0 54
 
4.4%
3 47
 
3.8%
7 29
 
2.4%
4 28
 
2.3%
Other values (7) 117
 
9.5%
Hangul
ValueCountFrequency (%)
120
 
7.6%
120
 
7.6%
97
 
6.1%
96
 
6.1%
93
 
5.9%
92
 
5.8%
90
 
5.7%
90
 
5.7%
86
 
5.4%
61
 
3.9%
Other values (117) 633
40.1%

전화번호
Text

MISSING 

Distinct20
Distinct (%)100.0%
Missing71
Missing (%)78.0%
Memory size860.0 B
2023-12-13T05:22:41.677776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.8
Min length8

Characters and Unicode

Total characters236
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)100.0%

Sample

1st row055-632-7005
2nd row055-633-3348
3rd row055-634-0801
4th row055-632-6662
5th row055-636-3018
ValueCountFrequency (%)
055-632-7005 1
 
5.0%
055-633-3348 1
 
5.0%
055-687-6022 1
 
5.0%
055-681-4788 1
 
5.0%
055-638-5005 1
 
5.0%
055-637-5536 1
 
5.0%
055-632-8405 1
 
5.0%
055-637-4333 1
 
5.0%
055-681-1804 1
 
5.0%
055-688-0565 1
 
5.0%
Other values (10) 10
50.0%
2023-12-13T05:22:41.949895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 46
19.5%
- 39
16.5%
0 33
14.0%
6 27
11.4%
8 25
10.6%
3 19
8.1%
4 13
 
5.5%
7 11
 
4.7%
2 9
 
3.8%
1 9
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 197
83.5%
Dash Punctuation 39
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 46
23.4%
0 33
16.8%
6 27
13.7%
8 25
12.7%
3 19
9.6%
4 13
 
6.6%
7 11
 
5.6%
2 9
 
4.6%
1 9
 
4.6%
9 5
 
2.5%
Dash Punctuation
ValueCountFrequency (%)
- 39
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 236
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 46
19.5%
- 39
16.5%
0 33
14.0%
6 27
11.4%
8 25
10.6%
3 19
8.1%
4 13
 
5.5%
7 11
 
4.7%
2 9
 
3.8%
1 9
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 236
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 46
19.5%
- 39
16.5%
0 33
14.0%
6 27
11.4%
8 25
10.6%
3 19
8.1%
4 13
 
5.5%
7 11
 
4.7%
2 9
 
3.8%
1 9
 
3.8%

기준일
Date

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size860.0 B
Minimum2023-08-01 00:00:00
Maximum2023-08-01 00:00:00
2023-12-13T05:22:42.050614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:22:42.129006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T05:22:39.374601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:22:42.203470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번유무료구분법인명사업소도로명주소전화번호
순번1.0000.0661.0001.0001.000
유무료구분0.0661.0001.0001.0001.000
법인명1.0001.0001.0001.0001.000
사업소도로명주소1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
2023-12-13T05:22:42.284248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번유무료구분
순번1.0000.035
유무료구분0.0351.000

Missing values

2023-12-13T05:22:39.499425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:22:39.613859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번유무료구분법인명사업소도로명주소전화번호기준일
01유료매일인력경상남도 거제시 거제중앙로20길 15-1. 102호 (고현동. 일진주택)<NA>2023-08-01
12유료대지인력경상남도 거제시 장평3로3길 18. 101호 (장평동)<NA>2023-08-01
23유료고현인력경상남도 거제시 고현로 96. 2층 201호 (고현동)<NA>2023-08-01
34유료주식회사 코러스인터네셔널경상남도 거제시 옥포성안로 71. 2층 201호 (옥포동. 이던플레이스)<NA>2023-08-01
45유료이피에이코비스코리아(주)경상남도 거제시 거제대로 4779-5. 거제조경공사 2층 201호 (고현동)<NA>2023-08-01
56유료주식회사 큐브경상남도 거제시 거제중앙로17길 10. 나인스카이시티 2층 203호 (고현동)<NA>2023-08-01
67유료현대인력경상남도 거제시 중곡로 3. 2층 (고현동)<NA>2023-08-01
78유료금아인터내셔널경상남도 거제시 아주1로2길 8. 스타타워 오피스텔 7층 703호 (아주동)<NA>2023-08-01
89유료이지오에스(EGOS)경상남도 거제시 덕포5길 29. 1층 (덕포동)<NA>2023-08-01
910유료(주)오션플랜트경상남도 거제시 고현천로 110. 5층 (고현동)<NA>2023-08-01
순번유무료구분법인명사업소도로명주소전화번호기준일
8182유료천하직업소개소경상남도 거제시 옥포로22길 49 (옥포동)<NA>2023-08-01
8283유료일일인력센타경상남도 거제시 고현로13길 16. 2층 (고현동)055-637-55362023-08-01
8384유료한일인력공사경상남도 거제시 고현로 121 (고현동)<NA>2023-08-01
8485유료하이디 직업소개소경상남도 거제시 서문로 54-2. 대성장여관 2층 (고현동)<NA>2023-08-01
8586유료우리가사원경상남도 거제시 중곡2로4길 30. 1층 (고현동)055-638-50052023-08-01
8687유료우리인력경상남도 거제시 탑곡로2길 18. 101호 (아주동)055-681-47882023-08-01
8788유료새거제 인력개발경상남도 거제시 옥포로 250. 옥현시장상가 122-1호 (옥포동)055-687-60222023-08-01
8889유료아주인력경상남도 거제시 아주3길 5-2 (아주동.2동)<NA>2023-08-01
8990유료터미널인력경상남도 거제시 고현로11길 3 (고현동)<NA>2023-08-01
9091유료대성유료직업소개소경상남도 거제시 옥포대첩로3길 27 (옥포동)687-47402023-08-01