Overview

Dataset statistics

Number of variables6
Number of observations31
Missing cells31
Missing cells (%)16.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory54.3 B

Variable types

Numeric1
Text3
Categorical1
Unsupported1

Dataset

Description한국산업안전보건공단에서 제공하는 산업안전보건 직무교육 위탁기관 현황 자료로
Author한국산업안전보건공단
URLhttps://www.data.go.kr/data/15065563/fileData.do

Alerts

Unnamed: 5 has 31 (100.0%) missing valuesMissing
연번 has unique valuesUnique
기관명 has unique valuesUnique
전화번호 has unique valuesUnique
주소명 has unique valuesUnique
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 01:47:36.958641
Analysis finished2023-12-12 01:47:37.615361
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.0 B
2023-12-12T10:47:37.713915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.5
Q18.5
median16
Q323.5
95-th percentile29.5
Maximum31
Range30
Interquartile range (IQR)15

Descriptive statistics

Standard deviation9.0921211
Coefficient of variation (CV)0.56825757
Kurtosis-1.2
Mean16
Median Absolute Deviation (MAD)8
Skewness0
Sum496
Variance82.666667
MonotonicityStrictly increasing
2023-12-12T10:47:37.937263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
1 1
 
3.2%
2 1
 
3.2%
31 1
 
3.2%
30 1
 
3.2%
29 1
 
3.2%
28 1
 
3.2%
27 1
 
3.2%
26 1
 
3.2%
25 1
 
3.2%
24 1
 
3.2%
Other values (21) 21
67.7%
ValueCountFrequency (%)
1 1
3.2%
2 1
3.2%
3 1
3.2%
4 1
3.2%
5 1
3.2%
6 1
3.2%
7 1
3.2%
8 1
3.2%
9 1
3.2%
10 1
3.2%
ValueCountFrequency (%)
31 1
3.2%
30 1
3.2%
29 1
3.2%
28 1
3.2%
27 1
3.2%
26 1
3.2%
25 1
3.2%
24 1
3.2%
23 1
3.2%
22 1
3.2%

기관명
Text

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023-12-12T10:47:38.210037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length13.258065
Min length6

Characters and Unicode

Total characters411
Distinct characters72
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row(사)대한산업보건협회대구경북지역본부
2nd row(사)대한산업보건협회부산경남지역본부
3rd row사단법인대한산업보건협회
4th row(사)대한산업안전협회
5th row(사)대한산업안전협회광주지역본부
ValueCountFrequency (%)
사)대한산업보건협회대구경북지역본부 1
 
3.2%
건설기술교육원 1
 
3.2%
사단법인한국화학안전협회 1
 
3.2%
한국해양수산연수원 1
 
3.2%
한국종합안전교육사회적협동조합 1
 
3.2%
사단법인환경안전보건협회 1
 
3.2%
사단법인환경안전기술원 1
 
3.2%
사단법인한국크레인협회 1
 
3.2%
사단법인한국안전보건협회 1
 
3.2%
사단법인한국안전기술협회 1
 
3.2%
Other values (21) 21
67.7%
2023-12-12T10:47:38.651908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
 
6.8%
27
 
6.6%
26
 
6.3%
22
 
5.4%
22
 
5.4%
21
 
5.1%
18
 
4.4%
16
 
3.9%
15
 
3.6%
15
 
3.6%
Other values (62) 201
48.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 389
94.6%
Open Punctuation 11
 
2.7%
Close Punctuation 11
 
2.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
28
 
7.2%
27
 
6.9%
26
 
6.7%
22
 
5.7%
22
 
5.7%
21
 
5.4%
18
 
4.6%
16
 
4.1%
15
 
3.9%
15
 
3.9%
Other values (60) 179
46.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 389
94.6%
Common 22
 
5.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
 
7.2%
27
 
6.9%
26
 
6.7%
22
 
5.7%
22
 
5.7%
21
 
5.4%
18
 
4.6%
16
 
4.1%
15
 
3.9%
15
 
3.9%
Other values (60) 179
46.0%
Common
ValueCountFrequency (%)
( 11
50.0%
) 11
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 389
94.6%
ASCII 22
 
5.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
28
 
7.2%
27
 
6.9%
26
 
6.7%
22
 
5.7%
22
 
5.7%
21
 
5.4%
18
 
4.6%
16
 
4.1%
15
 
3.9%
15
 
3.9%
Other values (60) 179
46.0%
ASCII
ValueCountFrequency (%)
( 11
50.0%
) 11
50.0%

전화번호
Text

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023-12-12T10:47:38.961953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.709677
Min length9

Characters and Unicode

Total characters363
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row053-592-4901
2nd row051-508-6088
3rd row02-2046-0511
4th row02-860-7000
5th row062-943-0156
ValueCountFrequency (%)
053-592-4901 1
 
3.2%
032-460-0243 1
 
3.2%
031-410-2992 1
 
3.2%
051-620-5411 1
 
3.2%
1588-2907 1
 
3.2%
02-3471-7534 1
 
3.2%
032-715-8134 1
 
3.2%
02-522-1507 1
 
3.2%
02-3666-9777 1
 
3.2%
031-431-3122 1
 
3.2%
Other values (21) 21
67.7%
2023-12-12T10:47:39.464229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 66
18.2%
- 60
16.5%
1 37
10.2%
2 35
9.6%
5 33
9.1%
4 28
7.7%
3 26
 
7.2%
7 23
 
6.3%
6 22
 
6.1%
8 18
 
5.0%
Other values (2) 15
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 302
83.2%
Dash Punctuation 60
 
16.5%
Math Symbol 1
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 66
21.9%
1 37
12.3%
2 35
11.6%
5 33
10.9%
4 28
9.3%
3 26
 
8.6%
7 23
 
7.6%
6 22
 
7.3%
8 18
 
6.0%
9 14
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 363
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 66
18.2%
- 60
16.5%
1 37
10.2%
2 35
9.6%
5 33
9.1%
4 28
7.7%
3 26
 
7.2%
7 23
 
6.3%
6 22
 
6.1%
8 18
 
5.0%
Other values (2) 15
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 363
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 66
18.2%
- 60
16.5%
1 37
10.2%
2 35
9.6%
5 33
9.1%
4 28
7.7%
3 26
 
7.2%
7 23
 
6.3%
6 22
 
6.1%
8 18
 
5.0%
Other values (2) 15
 
4.1%

주소명
Text

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023-12-12T10:47:39.854253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length72
Median length40
Mean length39.064516
Min length19

Characters and Unicode

Total characters1211
Distinct characters156
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row대구 달서구 성서공단로15길 40 (호림동) (19-15번지)
2nd row부산 금정구 중앙대로 2139 (청룡동) (80-4)
3rd row서울 서초구 효령로 179 4층 (서초동) (1490-32번지 4층)
4th row서울 구로구 공원로 70 (구로동) (23-1번지)
5th row광주 광산구 무진대로 270 (우산동 2층)
ValueCountFrequency (%)
서울특별시 6
 
2.7%
서울 5
 
2.2%
4층 5
 
2.2%
남동구 3
 
1.3%
금천구 3
 
1.3%
대구 3
 
1.3%
에이스하이테크시티 2
 
0.9%
1107호 2
 
0.9%
강남구 2
 
0.9%
논현두손지젤타워 2
 
0.9%
Other values (172) 193
85.4%
2023-12-12T10:47:40.415285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
213
 
17.6%
1 55
 
4.5%
) 48
 
4.0%
( 48
 
4.0%
43
 
3.6%
36
 
3.0%
0 31
 
2.6%
3 31
 
2.6%
31
 
2.6%
2 30
 
2.5%
Other values (146) 645
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 614
50.7%
Decimal Number 264
21.8%
Space Separator 213
 
17.6%
Close Punctuation 48
 
4.0%
Open Punctuation 48
 
4.0%
Dash Punctuation 17
 
1.4%
Uppercase Letter 6
 
0.5%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
43
 
7.0%
36
 
5.9%
31
 
5.0%
22
 
3.6%
22
 
3.6%
21
 
3.4%
20
 
3.3%
19
 
3.1%
17
 
2.8%
14
 
2.3%
Other values (128) 369
60.1%
Decimal Number
ValueCountFrequency (%)
1 55
20.8%
0 31
11.7%
3 31
11.7%
2 30
11.4%
4 26
9.8%
7 22
 
8.3%
9 21
 
8.0%
5 18
 
6.8%
8 15
 
5.7%
6 15
 
5.7%
Uppercase Letter
ValueCountFrequency (%)
D 3
50.0%
T 2
33.3%
W 1
 
16.7%
Space Separator
ValueCountFrequency (%)
213
100.0%
Close Punctuation
ValueCountFrequency (%)
) 48
100.0%
Open Punctuation
ValueCountFrequency (%)
( 48
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 614
50.7%
Common 591
48.8%
Latin 6
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
43
 
7.0%
36
 
5.9%
31
 
5.0%
22
 
3.6%
22
 
3.6%
21
 
3.4%
20
 
3.3%
19
 
3.1%
17
 
2.8%
14
 
2.3%
Other values (128) 369
60.1%
Common
ValueCountFrequency (%)
213
36.0%
1 55
 
9.3%
) 48
 
8.1%
( 48
 
8.1%
0 31
 
5.2%
3 31
 
5.2%
2 30
 
5.1%
4 26
 
4.4%
7 22
 
3.7%
9 21
 
3.6%
Other values (5) 66
 
11.2%
Latin
ValueCountFrequency (%)
D 3
50.0%
T 2
33.3%
W 1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 614
50.7%
ASCII 597
49.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
213
35.7%
1 55
 
9.2%
) 48
 
8.0%
( 48
 
8.0%
0 31
 
5.2%
3 31
 
5.2%
2 30
 
5.0%
4 26
 
4.4%
7 22
 
3.7%
9 21
 
3.5%
Other values (8) 72
 
12.1%
Hangul
ValueCountFrequency (%)
43
 
7.0%
36
 
5.9%
31
 
5.0%
22
 
3.6%
22
 
3.6%
21
 
3.4%
20
 
3.3%
19
 
3.1%
17
 
2.8%
14
 
2.3%
Other values (128) 369
60.1%

지정관서
Categorical

Distinct6
Distinct (%)19.4%
Missing0
Missing (%)0.0%
Memory size380.0 B
서 울 청
11 
중 부 청
부 산 청
대 구 청
광 주 청

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대 구 청
2nd row부 산 청
3rd row서 울 청
4th row서 울 청
5th row광 주 청

Common Values

ValueCountFrequency (%)
서 울 청 11
35.5%
중 부 청 6
19.4%
부 산 청 5
16.1%
대 구 청 4
 
12.9%
광 주 청 3
 
9.7%
대 전 청 2
 
6.5%

Length

2023-12-12T10:47:40.611597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:47:40.770541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
31
33.3%
11
 
11.8%
11
 
11.8%
11
 
11.8%
6
 
6.5%
6
 
6.5%
5
 
5.4%
4
 
4.3%
3
 
3.2%
3
 
3.2%

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing31
Missing (%)100.0%
Memory size411.0 B

Interactions

2023-12-12T10:47:37.286636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:47:40.899786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번기관명전화번호주소명지정관서
연번1.0001.0001.0001.0000.321
기관명1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
주소명1.0001.0001.0001.0001.000
지정관서0.3211.0001.0001.0001.000
2023-12-12T10:47:41.042419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지정관서
연번1.0000.000
지정관서0.0001.000

Missing values

2023-12-12T10:47:37.426576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:47:37.557016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번기관명전화번호주소명지정관서Unnamed: 5
01(사)대한산업보건협회대구경북지역본부053-592-4901대구 달서구 성서공단로15길 40 (호림동) (19-15번지)대 구 청<NA>
12(사)대한산업보건협회부산경남지역본부051-508-6088부산 금정구 중앙대로 2139 (청룡동) (80-4)부 산 청<NA>
23사단법인대한산업보건협회02-2046-0511서울 서초구 효령로 179 4층 (서초동) (1490-32번지 4층)서 울 청<NA>
34(사)대한산업안전협회02-860-7000서울 구로구 공원로 70 (구로동) (23-1번지)서 울 청<NA>
45(사)대한산업안전협회광주지역본부062-943-0156광주 광산구 무진대로 270 (우산동 2층)광 주 청<NA>
56(사)대한산업안전협회대구지역본부053-710-3101대구 수성구 화랑로 150 4층 (만촌동 동원빌딩) (1331-1번지 동원빌딩 4층)대 구 청<NA>
67(사)대한산업안전협회대전지역본부042-628-2160대전광역시 유성구 테크노2로 199-0 (용산동) 미건테크노월드1차 308호 (533번지 308호)대 전 청<NA>
78(사)대한산업안전협회부산지역본부051-804-5454부산 부산진구 전포대로199번길 15 현대타워오피스텔 8층 (전포동) (690-4번지 현대타워오피스텔 8층)부 산 청<NA>
89(사)대한산업안전협회충남서부지회041-669-1485충남 서산시 잠곡1로 19 (잠홍동) (575번지)대 전 청<NA>
910대한산업안전협회울산지회052-267-1500울산광역시 북구 명촌21길 4-9 (661-6)부 산 청<NA>
연번기관명전화번호주소명지정관서Unnamed: 5
2122사단법인한국산업안전심리상담협회02-3280-3833서울특별시 금천구 가산디지털1로 168 (씨동 1307호)서 울 청<NA>
2223사단법인한국안전기술협회031-431-3122경기 안산시 단원구 동산로 76 (원시동) 903 904 905호중 부 청<NA>
2324사단법인한국안전보건협회02-3666-9777서울 영등포구 경인로 775 1동 1107호 (문래동3가 에이스하이테크시티) (55-20번지 에이스하이테크시티 1동 1107호)서 울 청<NA>
2425사단법인한국크레인협회02-522-1507서울특별시 금천구 벚꽃로 254 14층 1408호 (월드메르디앙1차) (912-9번지 세양빌딩2층)서 울 청<NA>
2526사단법인환경안전기술원032-715-8134인천 남동구 호구포로 201 203호 (논현동 논현두손지젤타워) (644-4번지 논현두손지젤타워 203호)중 부 청<NA>
2627사단법인환경안전보건협회02-3471-7534서울특별시 동작구 동작대로 9 (사당동 태광빌딩 401호) (1061-17번지 지산빌딩 301호)서 울 청<NA>
2728한국종합안전교육사회적협동조합1588-2907광주 서구 하남대로 494 (동천동) 4층광 주 청<NA>
2829한국해양수산연수원051-620-5411부산광역시 영도구 해양로301번길 17(동삼동 부산항국제크루즈터미널)(1125번지 부산항국제크루즈터미널)부 산 청<NA>
2930사단법인한국화학안전협회031-410-2992경기도 안산시 단원구 고잔1길 76 리치프라자 402호중 부 청<NA>
3031사단법인안전보건진흥원02-805-4545서울특별시 금천구 범안로 1152(독산동 독산빌딩3)서 울 청<NA>