Overview

Dataset statistics

Number of variables5
Number of observations50
Missing cells8
Missing cells (%)3.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory42.6 B

Variable types

Categorical2
Text3

Dataset

Description매년 공표되는 근로자 건강증진활동 관련 민간 전문기관 현황 직업건강증진팀에서 게시한 2023년도 「근로자 건강증진활동」 관련 민간 전문기관 현황입니다.
URLhttps://www.data.go.kr/data/15068768/fileData.do

Alerts

전문분야 is highly overall correlated with 기관명High correlation
기관명 is highly overall correlated with 전문분야High correlation
전문분야 is highly imbalanced (60.2%)Imbalance
지사명 has 8 (16.0%) missing valuesMissing
전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:58:50.443687
Analysis finished2023-12-12 04:58:51.029226
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관명
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
(사)한국직업건강간호협회
22 
대한산업보건협회
20 
㈜다인
 
1
마음의 숲
 
1
(재)숲과나눔 일환경건강센터
 
1
Other values (5)

Length

Max length15
Median length13
Mean length10.26
Min length3

Unique

Unique8 ?
Unique (%)16.0%

Sample

1st row㈜다인
2nd row대한산업보건협회
3rd row대한산업보건협회
4th row대한산업보건협회
5th row대한산업보건협회

Common Values

ValueCountFrequency (%)
(사)한국직업건강간호협회 22
44.0%
대한산업보건협회 20
40.0%
㈜다인 1
 
2.0%
마음의 숲 1
 
2.0%
(재)숲과나눔 일환경건강센터 1
 
2.0%
주식회사터직업환경의학센터 1
 
2.0%
한국금연운동협의회 1
 
2.0%
한국산업안전원 1
 
2.0%
한국산업위생협회 1
 
2.0%
한국EAP협회 1
 
2.0%

Length

2023-12-12T13:58:51.138409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:58:51.307084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사)한국직업건강간호협회 22
42.3%
대한산업보건협회 20
38.5%
㈜다인 1
 
1.9%
마음의 1
 
1.9%
1
 
1.9%
재)숲과나눔 1
 
1.9%
일환경건강센터 1
 
1.9%
주식회사터직업환경의학센터 1
 
1.9%
한국금연운동협의회 1
 
1.9%
한국산업안전원 1
 
1.9%
Other values (2) 2
 
3.8%

전문분야
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리
42 
직무스트레스관리
 
3
뇌심혈관질환예방 근골격계질환예방 생활습관개선 직무스트레스관리
 
3
뇌심혈관질환예방 근골격계질환예방 직무스트레스관리 생활습관개선
 
1
금연프로그램
 
1

Length

Max length40
Median length40
Mean length36.84
Min length6

Unique

Unique2 ?
Unique (%)4.0%

Sample

1st row직무스트레스관리
2nd row뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리
3rd row뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리
4th row뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리
5th row뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리

Common Values

ValueCountFrequency (%)
뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리 42
84.0%
직무스트레스관리 3
 
6.0%
뇌심혈관질환예방 근골격계질환예방 생활습관개선 직무스트레스관리 3
 
6.0%
뇌심혈관질환예방 근골격계질환예방 직무스트레스관리 생활습관개선 1
 
2.0%
금연프로그램 1
 
2.0%

Length

2023-12-12T13:58:51.480388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:58:51.621811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
직무스트레스관리 49
21.3%
뇌심혈관질환예방 46
20.0%
근골격계질환예방 46
20.0%
생활습관개선(금연 42
18.3%
절주 42
18.3%
생활습관개선 4
 
1.7%
금연프로그램 1
 
0.4%

지사명
Text

MISSING 

Distinct42
Distinct (%)100.0%
Missing8
Missing (%)16.0%
Memory size532.0 B
2023-12-12T13:58:51.893402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length8
Mean length8.7619048
Min length6

Characters and Unicode

Total characters368
Distinct characters42
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)100.0%

Sample

1st row서울지역본부
2nd row경인지역본부
3rd row경기북부산업보건센터
4th row경기서부산업보건센터
5th row강원산업보건센터
ValueCountFrequency (%)
강원산업보건센터 2
 
4.4%
광주전남북지역본부 1
 
2.2%
서울보건안전센터 1
 
2.2%
부천보건안전센터 1
 
2.2%
경기동부보건안전센터 1
 
2.2%
경기서부보건안전센터 1
 
2.2%
경기남부보건안전센터 1
 
2.2%
경기북부보건안전센터 1
 
2.2%
대전보건안전센터 1
 
2.2%
충남보건안전센터 1
 
2.2%
Other values (34) 34
75.6%
2023-12-12T13:58:52.435640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
36
 
9.8%
35
 
9.5%
35
 
9.5%
35
 
9.5%
28
 
7.6%
22
 
6.0%
21
 
5.7%
20
 
5.4%
15
 
4.1%
12
 
3.3%
Other values (32) 109
29.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 365
99.2%
Space Separator 3
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
9.9%
35
 
9.6%
35
 
9.6%
35
 
9.6%
28
 
7.7%
22
 
6.0%
21
 
5.8%
20
 
5.5%
15
 
4.1%
12
 
3.3%
Other values (31) 106
29.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 365
99.2%
Common 3
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
9.9%
35
 
9.6%
35
 
9.6%
35
 
9.6%
28
 
7.7%
22
 
6.0%
21
 
5.8%
20
 
5.5%
15
 
4.1%
12
 
3.3%
Other values (31) 106
29.0%
Common
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 365
99.2%
ASCII 3
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
36
 
9.9%
35
 
9.6%
35
 
9.6%
35
 
9.6%
28
 
7.7%
22
 
6.0%
21
 
5.8%
20
 
5.5%
15
 
4.1%
12
 
3.3%
Other values (31) 106
29.0%
ASCII
ValueCountFrequency (%)
3
100.0%
Distinct49
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-12T13:58:52.766347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length35.5
Mean length28.6
Min length16

Characters and Unicode

Total characters1430
Distinct characters194
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)96.0%

Sample

1st row서울시 강남구 신사동 도산대로 149 진우빌딩 5층
2nd row서울시 금천구 디지털로9길 56 코오롱테크노벨리 2층
3rd row경기도 수원시 팔달구 인계로 126 미디어시티 5층
4th row경기도 의정부시 범골로 142
5th row경기도 안산시 단원구 신길로1길 86 신우프라자 2~5층
ValueCountFrequency (%)
2층 8
 
2.9%
경기도 6
 
2.2%
서울시 4
 
1.5%
청주시 4
 
1.5%
대구광역시 3
 
1.1%
서울 3
 
1.1%
전라북도 3
 
1.1%
창원시 3
 
1.1%
3층 3
 
1.1%
흥덕구 3
 
1.1%
Other values (205) 234
85.4%
2023-12-12T13:58:53.236561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
226
 
15.8%
1 66
 
4.6%
2 58
 
4.1%
51
 
3.6%
47
 
3.3%
40
 
2.8%
4 38
 
2.7%
0 35
 
2.4%
3 34
 
2.4%
27
 
1.9%
Other values (184) 808
56.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 815
57.0%
Decimal Number 314
 
22.0%
Space Separator 226
 
15.8%
Open Punctuation 27
 
1.9%
Close Punctuation 27
 
1.9%
Dash Punctuation 17
 
1.2%
Math Symbol 2
 
0.1%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
 
6.3%
47
 
5.8%
40
 
4.9%
27
 
3.3%
25
 
3.1%
23
 
2.8%
23
 
2.8%
20
 
2.5%
19
 
2.3%
18
 
2.2%
Other values (167) 522
64.0%
Decimal Number
ValueCountFrequency (%)
1 66
21.0%
2 58
18.5%
4 38
12.1%
0 35
11.1%
3 34
10.8%
6 21
 
6.7%
5 20
 
6.4%
7 16
 
5.1%
8 14
 
4.5%
9 12
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
226
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 815
57.0%
Common 613
42.9%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
 
6.3%
47
 
5.8%
40
 
4.9%
27
 
3.3%
25
 
3.1%
23
 
2.8%
23
 
2.8%
20
 
2.5%
19
 
2.3%
18
 
2.2%
Other values (167) 522
64.0%
Common
ValueCountFrequency (%)
226
36.9%
1 66
 
10.8%
2 58
 
9.5%
4 38
 
6.2%
0 35
 
5.7%
3 34
 
5.5%
( 27
 
4.4%
) 27
 
4.4%
6 21
 
3.4%
5 20
 
3.3%
Other values (5) 61
 
10.0%
Latin
ValueCountFrequency (%)
S 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 815
57.0%
ASCII 615
43.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
226
36.7%
1 66
 
10.7%
2 58
 
9.4%
4 38
 
6.2%
0 35
 
5.7%
3 34
 
5.5%
( 27
 
4.4%
) 27
 
4.4%
6 21
 
3.4%
5 20
 
3.3%
Other values (7) 63
 
10.2%
Hangul
ValueCountFrequency (%)
51
 
6.3%
47
 
5.8%
40
 
4.9%
27
 
3.3%
25
 
3.1%
23
 
2.8%
23
 
2.8%
20
 
2.5%
19
 
2.3%
18
 
2.2%
Other values (167) 522
64.0%

전화번호
Text

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-12T13:58:53.852502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length12.12
Min length9

Characters and Unicode

Total characters606
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)100.0%

Sample

1st row02-2268-5980
2nd row02-866-9507
3rd row031-267-4400
4th row031-828-3800
5th row031-498-1063
ValueCountFrequency (%)
02-2268-5980 1
 
2.0%
053-744-5412 1
 
2.0%
02-782-3380 1
 
2.0%
02-701-9036 1
 
2.0%
032-422-1084 1
 
2.0%
031-756-0274 1
 
2.0%
031-485-0090 1
 
2.0%
031-223-5447 1
 
2.0%
031-876-4273 1
 
2.0%
042-582-9052 1
 
2.0%
Other values (40) 40
80.0%
2023-12-12T13:58:54.276059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 101
16.7%
0 88
14.5%
2 77
12.7%
3 57
9.4%
1 54
8.9%
5 48
7.9%
6 44
7.3%
4 39
 
6.4%
8 33
 
5.4%
7 33
 
5.4%
Other values (3) 32
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 500
82.5%
Dash Punctuation 101
 
16.7%
Math Symbol 3
 
0.5%
Space Separator 2
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 88
17.6%
2 77
15.4%
3 57
11.4%
1 54
10.8%
5 48
9.6%
6 44
8.8%
4 39
7.8%
8 33
 
6.6%
7 33
 
6.6%
9 27
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 101
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 606
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 101
16.7%
0 88
14.5%
2 77
12.7%
3 57
9.4%
1 54
8.9%
5 48
7.9%
6 44
7.3%
4 39
 
6.4%
8 33
 
5.4%
7 33
 
5.4%
Other values (3) 32
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 606
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 101
16.7%
0 88
14.5%
2 77
12.7%
3 57
9.4%
1 54
8.9%
5 48
7.9%
6 44
7.3%
4 39
 
6.4%
8 33
 
5.4%
7 33
 
5.4%
Other values (3) 32
 
5.3%

Correlations

2023-12-12T13:58:54.385277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관명전문분야지사명소재지전화번호
기관명1.0001.0001.0001.0001.000
전문분야1.0001.000NaN1.0001.000
지사명1.000NaN1.0001.0001.000
소재지1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
2023-12-12T13:58:54.491337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전문분야기관명
전문분야1.0000.943
기관명0.9431.000
2023-12-12T13:58:54.584340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관명전문분야
기관명1.0000.943
전문분야0.9431.000

Missing values

2023-12-12T13:58:50.836633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:58:50.973013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관명전문분야지사명소재지전화번호
0㈜다인직무스트레스관리<NA>서울시 강남구 신사동 도산대로 149 진우빌딩 5층02-2268-5980
1대한산업보건협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리서울지역본부서울시 금천구 디지털로9길 56 코오롱테크노벨리 2층02-866-9507
2대한산업보건협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리경인지역본부경기도 수원시 팔달구 인계로 126 미디어시티 5층031-267-4400
3대한산업보건협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리경기북부산업보건센터경기도 의정부시 범골로 142031-828-3800
4대한산업보건협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리경기서부산업보건센터경기도 안산시 단원구 신길로1길 86 신우프라자 2~5층031-498-1063
5대한산업보건협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리강원산업보건센터강원도 춘천시 중앙로 172 7층 8층(근화동)033-254-4632
6대한산업보건협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리강원산업보건센터 원주사무소강원도 원주시 시청로 21-1 401호031-731-4725
7대한산업보건협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리인천산업보건센터인천광역시 미추홀구 경원대로 812 2층032-430-9300
8대한산업보건협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리대전충남지역본부대전광역시 대덕구 대덕대로 1403042-933-3200
9대한산업보건협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리천안산업보건센터충청남도 아산시 배방읍 희망로46번길 46-8 3층041-536-6900
기관명전문분야지사명소재지전화번호
40(사)한국직업건강간호협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리광주보건안전센터광주광역시 서구 화운로199번길7(화정동95-1) 2층남서쪽 사무실062-972-2021
41(사)한국직업건강간호협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리제주보건안전센터제주시 신대로 22길 25(연동 1373-1) 아일랜드마이빌 201호064-711-7823
42(사)한국직업건강간호협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리전남동부보건안전센터전남 여수시 무선6길24(선원동1233-12) 1층061-681-0676
43(사)한국직업건강간호협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리경남동부보건안전센터경남 양산시 동면 금오로247(석산리) 402호055-389-1412
44(사)한국직업건강간호협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리울산보건안전센터울산광역시 남구 대학로128(무거동466-2) 하늘빌딩 3층052-277-8624
45(사)한국직업건강간호협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리경남보건안전센터경상남도 창원시 마산합포구 해안대로343(남성동247-8) 8층055-221-0763
46(사)한국직업건강간호협회뇌심혈관질환예방 근골격계질환예방 생활습관개선(금연 절주) 직무스트레스관리전남보건안전센터전라남도 무안군 삼향읍 후광대로282(남악리2113) 11층(1104호)061-285-7256
47한국산업안전원뇌심혈관질환예방 근골격계질환예방 생활습관개선 직무스트레스관리<NA>충북 청주시 청원구 오창읍 중심상업로 31-4 605호(엔젤오메가빌딩)043-211-1909
48한국산업위생협회뇌심혈관질환예방 근골격계질환예방 생활습관개선 직무스트레스관리<NA>경기 부천시 소사구 경인로 124 송산빌딩 6층02-782-3380
49한국EAP협회직무스트레스관리<NA>서울 중구 수표로 45 1201호1566-5228