Overview

Dataset statistics

Number of variables6
Number of observations123
Missing cells327
Missing cells (%)44.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.1 KiB
Average record size in memory51.1 B

Variable types

Categorical1
Text3
Unsupported2

Dataset

Description사이트구분,기관명,사이트주소,주소_연락처,등록일시,수정일시
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-21058/S/1/datasetView.do

Alerts

주소_연락처 has 81 (65.9%) missing valuesMissing
등록일시 has 123 (100.0%) missing valuesMissing
수정일시 has 123 (100.0%) missing valuesMissing
기관명 has unique valuesUnique
사이트주소 has unique valuesUnique
등록일시 is an unsupported type, check if it needs cleaning or further analysisUnsupported
수정일시 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-03 22:21:48.845727
Analysis finished2024-05-03 22:21:50.173230
Duration1.33 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사이트구분
Categorical

Distinct10
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
자치구 취업정보
25 
여성관련사이트
23 
공무원사이트
19 
전문사이트
17 
취업포털
10 
Other values (5)
29 

Length

Max length12
Median length8
Mean length6.902439
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row취업포털
2nd row취업포털
3rd row취업포털
4th row취업포털
5th row취업포털

Common Values

ValueCountFrequency (%)
자치구 취업정보 25
20.3%
여성관련사이트 23
18.7%
공무원사이트 19
15.4%
전문사이트 17
13.8%
취업포털 10
 
8.1%
고령자관련사이트 7
 
5.7%
해외취업사이트 7
 
5.7%
직업훈련 및 자격사이트 6
 
4.9%
외국계기업사이트 5
 
4.1%
장애인관련사이트 4
 
3.3%

Length

2024-05-03T22:21:50.436738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-03T22:21:50.861796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자치구 25
15.6%
취업정보 25
15.6%
여성관련사이트 23
14.4%
공무원사이트 19
11.9%
전문사이트 17
10.6%
취업포털 10
 
6.2%
고령자관련사이트 7
 
4.4%
해외취업사이트 7
 
4.4%
직업훈련 6
 
3.8%
6
 
3.8%
Other values (3) 15
9.4%

기관명
Text

UNIQUE 

Distinct123
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-05-03T22:21:51.706641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length14
Mean length8.4308943
Min length2

Characters and Unicode

Total characters1037
Distinct characters192
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique123 ?
Unique (%)100.0%

Sample

1st row워크넷
2nd row제대군인 지원센터
3rd row커리어
4th row잡코리아
5th row인크루트
ValueCountFrequency (%)
일자리플러스센터 18
 
10.8%
전문 3
 
1.8%
교육청 3
 
1.8%
일자리지원센터 2
 
1.2%
일자리센터 2
 
1.2%
서울시 2
 
1.2%
워크넷 1
 
0.6%
한국철도공사 1
 
0.6%
경찰청 1
 
0.6%
서울지방경찰청 1
 
0.6%
Other values (132) 132
79.5%
2024-05-03T22:21:53.099896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
 
5.2%
54
 
5.2%
43
 
4.1%
40
 
3.9%
30
 
2.9%
29
 
2.8%
27
 
2.6%
26
 
2.5%
26
 
2.5%
26
 
2.5%
Other values (182) 682
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 985
95.0%
Space Separator 43
 
4.1%
Close Punctuation 2
 
0.2%
Open Punctuation 2
 
0.2%
Uppercase Letter 2
 
0.2%
Decimal Number 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
5.5%
54
 
5.5%
40
 
4.1%
30
 
3.0%
29
 
2.9%
27
 
2.7%
26
 
2.6%
26
 
2.6%
26
 
2.6%
24
 
2.4%
Other values (172) 649
65.9%
Close Punctuation
ValueCountFrequency (%)
] 1
50.0%
) 1
50.0%
Open Punctuation
ValueCountFrequency (%)
[ 1
50.0%
( 1
50.0%
Uppercase Letter
ValueCountFrequency (%)
U 1
50.0%
P 1
50.0%
Decimal Number
ValueCountFrequency (%)
5 1
50.0%
0 1
50.0%
Space Separator
ValueCountFrequency (%)
43
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 985
95.0%
Common 50
 
4.8%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
5.5%
54
 
5.5%
40
 
4.1%
30
 
3.0%
29
 
2.9%
27
 
2.7%
26
 
2.6%
26
 
2.6%
26
 
2.6%
24
 
2.4%
Other values (172) 649
65.9%
Common
ValueCountFrequency (%)
43
86.0%
] 1
 
2.0%
[ 1
 
2.0%
5 1
 
2.0%
& 1
 
2.0%
0 1
 
2.0%
) 1
 
2.0%
( 1
 
2.0%
Latin
ValueCountFrequency (%)
U 1
50.0%
P 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 985
95.0%
ASCII 52
 
5.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
 
5.5%
54
 
5.5%
40
 
4.1%
30
 
3.0%
29
 
2.9%
27
 
2.7%
26
 
2.6%
26
 
2.6%
26
 
2.6%
24
 
2.4%
Other values (172) 649
65.9%
ASCII
ValueCountFrequency (%)
43
82.7%
] 1
 
1.9%
[ 1
 
1.9%
U 1
 
1.9%
5 1
 
1.9%
& 1
 
1.9%
0 1
 
1.9%
) 1
 
1.9%
( 1
 
1.9%
P 1
 
1.9%

사이트주소
Text

UNIQUE 

Distinct123
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-05-03T22:21:53.751618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length32
Mean length26.471545
Min length18

Characters and Unicode

Total characters3256
Distinct characters35
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique123 ?
Unique (%)100.0%

Sample

1st rowhttp://www.work.go.kr
2nd rowhttp://www.vnet.go.kr
3rd rowhttp://www.career.co.kr
4th rowhttp://www.jobkorea.co.kr
5th rowhttp://www.incruit.co.kr
ValueCountFrequency (%)
http://www.work.go.kr 1
 
0.8%
http://www.kordi.or.kr 1
 
0.8%
http://www.dbedu.or.kr 1
 
0.8%
http://www.jbedu.or.kr 1
 
0.8%
http://www.bukedu.or.kr 1
 
0.8%
http://www.korail.go.kr 1
 
0.8%
http://www.nfsa.go.kr 1
 
0.8%
http://fire.seoul.go.kr 1
 
0.8%
http://www.smpa.go.kr 1
 
0.8%
http://www.police.go.kr 1
 
0.8%
Other values (113) 113
91.9%
2024-05-03T22:21:54.934835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 356
 
10.9%
/ 320
 
9.8%
o 301
 
9.2%
t 270
 
8.3%
w 251
 
7.7%
r 203
 
6.2%
p 159
 
4.9%
k 150
 
4.6%
h 137
 
4.2%
: 123
 
3.8%
Other values (25) 986
30.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2446
75.1%
Other Punctuation 799
 
24.5%
Decimal Number 9
 
0.3%
Dash Punctuation 1
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 301
12.3%
t 270
11.0%
w 251
 
10.3%
r 203
 
8.3%
p 159
 
6.5%
k 150
 
6.1%
h 137
 
5.6%
e 121
 
4.9%
s 119
 
4.9%
g 103
 
4.2%
Other values (15) 632
25.8%
Decimal Number
ValueCountFrequency (%)
0 3
33.3%
6 2
22.2%
5 2
22.2%
3 1
 
11.1%
2 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
. 356
44.6%
/ 320
40.1%
: 123
 
15.4%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2447
75.2%
Common 809
 
24.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 301
12.3%
t 270
11.0%
w 251
 
10.3%
r 203
 
8.3%
p 159
 
6.5%
k 150
 
6.1%
h 137
 
5.6%
e 121
 
4.9%
s 119
 
4.9%
g 103
 
4.2%
Other values (16) 633
25.9%
Common
ValueCountFrequency (%)
. 356
44.0%
/ 320
39.6%
: 123
 
15.2%
0 3
 
0.4%
6 2
 
0.2%
5 2
 
0.2%
3 1
 
0.1%
2 1
 
0.1%
- 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3256
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 356
 
10.9%
/ 320
 
9.8%
o 301
 
9.2%
t 270
 
8.3%
w 251
 
7.7%
r 203
 
6.2%
p 159
 
4.9%
k 150
 
4.6%
h 137
 
4.2%
: 123
 
3.8%
Other values (25) 986
30.3%

주소_연락처
Text

MISSING 

Distinct42
Distinct (%)100.0%
Missing81
Missing (%)65.9%
Memory size1.1 KiB
2024-05-03T22:21:55.573483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length57.5
Mean length48.785714
Min length28

Characters and Unicode

Total characters2049
Distinct characters162
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)100.0%

Sample

1st row서울 종로구 삼봉로 43(수송동 146-2) 종로구청 본관 1층(민원봉사실과 별도 공간) 02)2148-3958
2nd row서울 중구 창경궁로 17(예관동) 중구청 별관1층(교통민원실 내) 02)3396-5695
3rd row서울 용산구 녹사평대로 150(이태원동) 용산구청 5층(고용정책과 사무실 내) 02)2199-7214~6
4th row서울 성동구 고산자로 270 성동구청 본관 1층(민원실과 우리은행 사이 별도 공간) 02)2286-5408~10
5th row서울 광진구 자양로 117(자양동) 광진구청 제3별관 2층 02)450-1419
ValueCountFrequency (%)
서울 42
 
12.1%
본관 17
 
4.9%
별도 11
 
3.2%
공간 11
 
3.2%
7
 
2.0%
1층(민원봉사실과 5
 
1.4%
2층(민원봉사실과 4
 
1.2%
3층 3
 
0.9%
노원구 2
 
0.6%
1층(민원봉사실내 2
 
0.6%
Other values (222) 242
69.9%
2024-05-03T22:21:56.821838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
304
 
14.8%
2 123
 
6.0%
1 102
 
5.0%
0 89
 
4.3%
) 89
 
4.3%
72
 
3.5%
5 61
 
3.0%
- 59
 
2.9%
55
 
2.7%
53
 
2.6%
Other values (152) 1042
50.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 891
43.5%
Decimal Number 637
31.1%
Space Separator 304
 
14.8%
Close Punctuation 89
 
4.3%
Dash Punctuation 59
 
2.9%
Open Punctuation 47
 
2.3%
Math Symbol 19
 
0.9%
Other Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
72
 
8.1%
55
 
6.2%
53
 
5.9%
43
 
4.8%
39
 
4.4%
37
 
4.2%
27
 
3.0%
25
 
2.8%
23
 
2.6%
22
 
2.5%
Other values (135) 495
55.6%
Decimal Number
ValueCountFrequency (%)
2 123
19.3%
1 102
16.0%
0 89
14.0%
5 61
9.6%
4 52
8.2%
3 51
8.0%
9 47
 
7.4%
6 45
 
7.1%
7 34
 
5.3%
8 33
 
5.2%
Other Punctuation
ValueCountFrequency (%)
/ 2
66.7%
, 1
33.3%
Space Separator
ValueCountFrequency (%)
304
100.0%
Close Punctuation
ValueCountFrequency (%)
) 89
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 59
100.0%
Open Punctuation
ValueCountFrequency (%)
( 47
100.0%
Math Symbol
ValueCountFrequency (%)
~ 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1158
56.5%
Hangul 891
43.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
72
 
8.1%
55
 
6.2%
53
 
5.9%
43
 
4.8%
39
 
4.4%
37
 
4.2%
27
 
3.0%
25
 
2.8%
23
 
2.6%
22
 
2.5%
Other values (135) 495
55.6%
Common
ValueCountFrequency (%)
304
26.3%
2 123
10.6%
1 102
 
8.8%
0 89
 
7.7%
) 89
 
7.7%
5 61
 
5.3%
- 59
 
5.1%
4 52
 
4.5%
3 51
 
4.4%
9 47
 
4.1%
Other values (7) 181
15.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1158
56.5%
Hangul 891
43.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
304
26.3%
2 123
10.6%
1 102
 
8.8%
0 89
 
7.7%
) 89
 
7.7%
5 61
 
5.3%
- 59
 
5.1%
4 52
 
4.5%
3 51
 
4.4%
9 47
 
4.1%
Other values (7) 181
15.6%
Hangul
ValueCountFrequency (%)
72
 
8.1%
55
 
6.2%
53
 
5.9%
43
 
4.8%
39
 
4.4%
37
 
4.2%
27
 
3.0%
25
 
2.8%
23
 
2.6%
22
 
2.5%
Other values (135) 495
55.6%

등록일시
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing123
Missing (%)100.0%
Memory size1.2 KiB

수정일시
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing123
Missing (%)100.0%
Memory size1.2 KiB

Correlations

2024-05-03T22:21:57.094382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사이트구분주소_연락처
사이트구분1.0001.000
주소_연락처1.0001.000

Missing values

2024-05-03T22:21:49.625016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-03T22:21:50.030326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사이트구분기관명사이트주소주소_연락처등록일시수정일시
0취업포털워크넷http://www.work.go.kr<NA><NA><NA>
1취업포털제대군인 지원센터http://www.vnet.go.kr<NA><NA><NA>
2취업포털커리어http://www.career.co.kr<NA><NA><NA>
3취업포털잡코리아http://www.jobkorea.co.kr<NA><NA><NA>
4취업포털인크루트http://www.incruit.co.kr<NA><NA><NA>
5취업포털사람인http://www.saramin.co.kr<NA><NA><NA>
6취업포털스카우트http://scout.co.kr<NA><NA><NA>
7취업포털파인드잡http://findjob.co.kr<NA><NA><NA>
8취업포털에듀스http://www.educe.co.kr<NA><NA><NA>
9취업포털경찰전직지원센터http://www.polsenior.co.kr<NA><NA><NA>
사이트구분기관명사이트주소주소_연락처등록일시수정일시
113전문사이트재경분야 전문http://www.accountingpeople.co.kr<NA><NA><NA>
114전문사이트외식업 전문http://www.foodwork.co.kr<NA><NA><NA>
115전문사이트잡쿡http://www.cookfindjob.co.kr<NA><NA><NA>
116전문사이트운전인 전문http://www.jobcar.co.kr<NA><NA><NA>
117전문사이트섬유 패션 취업포털http://www.fashionwork.co.kr<NA><NA><NA>
118전문사이트의료계 전문사이트http://www.medicaljob.co.kr<NA><NA><NA>
119전문사이트석박사 네트워크http://www.hibrain.net<NA><NA><NA>
120전문사이트관광전문인력포털[관광인]http://academy.visitkorea.or.kr<NA><NA><NA>
121전문사이트건설일드림넷http://www.cid.or.kr<NA><NA><NA>
122전문사이트전시UP 전시산업 구인구직사이트http://www.expoup.or.kr/<NA><NA><NA>