Overview

Dataset statistics

Number of variables6
Number of observations25
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory54.1 B

Variable types

Numeric1
Categorical3
Text2

Dataset

Descriptionㅇ 남녀고용평등 실현에 앞장선 우수기업과 유공자를 발굴 시상하여 남녀가 동등하게 일할 수 있는 고용환경을 조성하도록 장려하고, 사회 전반에 남녀고용평등 의식 확산을 유도 ㅇ 적극적 고용개선조치(Affirmative Action) 이행실적 평가결과 우수기업을 시상하여 이행률 제고 ㅇ 남녀고용평등 포상 우수기업 명단 제공
Author공공데이터포털
URLhttps://www.data.go.kr/data/15066117/fileData.do

Alerts

연번 is highly overall correlated with 규모High correlation
규모 is highly overall correlated with 연번High correlation
사업장명 has unique valuesUnique

Reproduction

Analysis started2024-04-20 17:12:18.980189
Analysis finished2024-04-20 17:12:20.146164
Duration1.17 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION 

Distinct15
Distinct (%)60.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7
Minimum1
Maximum15
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size353.0 B
2024-04-21T02:12:20.300683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.2
Q14
median7
Q310
95-th percentile13.8
Maximum15
Range14
Interquartile range (IQR)6

Descriptive statistics

Standard deviation4.0824829
Coefficient of variation (CV)0.58321184
Kurtosis-0.8291502
Mean7
Median Absolute Deviation (MAD)3
Skewness0.29953
Sum175
Variance16.666667
MonotonicityNot monotonic
2024-04-21T02:12:20.651151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
1 2
 
8.0%
2 2
 
8.0%
3 2
 
8.0%
4 2
 
8.0%
5 2
 
8.0%
6 2
 
8.0%
7 2
 
8.0%
8 2
 
8.0%
9 2
 
8.0%
10 2
 
8.0%
Other values (5) 5
20.0%
ValueCountFrequency (%)
1 2
8.0%
2 2
8.0%
3 2
8.0%
4 2
8.0%
5 2
8.0%
6 2
8.0%
7 2
8.0%
8 2
8.0%
9 2
8.0%
10 2
8.0%
ValueCountFrequency (%)
15 1
4.0%
14 1
4.0%
13 1
4.0%
12 1
4.0%
11 1
4.0%
10 2
8.0%
9 2
8.0%
8 2
8.0%
7 2
8.0%
6 2
8.0%

분야
Categorical

Distinct2
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size328.0 B
남녀고용평등분야
15 
적극적고용개선조치분야
10 

Length

Max length11
Median length8
Mean length9.2
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남녀고용평등분야
2nd row남녀고용평등분야
3rd row남녀고용평등분야
4th row남녀고용평등분야
5th row남녀고용평등분야

Common Values

ValueCountFrequency (%)
남녀고용평등분야 15
60.0%
적극적고용개선조치분야 10
40.0%

Length

2024-04-21T02:12:21.062906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T02:12:21.387129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남녀고용평등분야 15
60.0%
적극적고용개선조치분야 10
40.0%
Distinct17
Distinct (%)68.0%
Missing0
Missing (%)0.0%
Memory size328.0 B
2024-04-21T02:12:21.892223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.96
Min length2

Characters and Unicode

Total characters74
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)56.0%

Sample

1st row서울청
2nd row고양
3rd row서울강남
4th row서울강남
5th row서울청
ValueCountFrequency (%)
서울청 6
24.0%
서울동부 3
12.0%
서울강남 2
 
8.0%
구미 1
 
4.0%
대전청 1
 
4.0%
보령 1
 
4.0%
서울북부 1
 
4.0%
충주 1
 
4.0%
서울남부 1
 
4.0%
고양 1
 
4.0%
Other values (7) 7
28.0%
2024-04-21T02:12:22.884046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15
20.3%
14
18.9%
9
12.2%
6
 
8.1%
3
 
4.1%
3
 
4.1%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
Other values (16) 16
21.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 74
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
20.3%
14
18.9%
9
12.2%
6
 
8.1%
3
 
4.1%
3
 
4.1%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
Other values (16) 16
21.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 74
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
20.3%
14
18.9%
9
12.2%
6
 
8.1%
3
 
4.1%
3
 
4.1%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
Other values (16) 16
21.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 74
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
15
20.3%
14
18.9%
9
12.2%
6
 
8.1%
3
 
4.1%
3
 
4.1%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
Other values (16) 16
21.6%

사업장명
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size328.0 B
2024-04-21T02:12:23.456965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length9.44
Min length4

Characters and Unicode

Total characters236
Distinct characters122
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st rowSK 주식회사
2nd row주식회사 선영종합엔지니어링
3rd row랄프로렌코리아 유한회사
4th row㈜골프존
5th row㈜코리아세븐
ValueCountFrequency (%)
주식회사 5
 
14.3%
2
 
5.7%
sk 1
 
2.9%
국립생태원 1
 
2.9%
서울특별시도봉구시설관리공단 1
 
2.9%
한국과학기술기획평가원 1
 
2.9%
엘지이노텍 1
 
2.9%
서울올림픽기념국민체육진흥공단 1
 
2.9%
국제방송교류재단 1
 
2.9%
재단법인 1
 
2.9%
Other values (20) 20
57.1%
2024-04-21T02:12:24.254267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
 
4.7%
10
 
4.2%
10
 
4.2%
9
 
3.8%
7
 
3.0%
6
 
2.5%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
Other values (112) 162
68.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 213
90.3%
Space Separator 10
 
4.2%
Other Symbol 10
 
4.2%
Uppercase Letter 2
 
0.8%
Lowercase Letter 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
5.2%
9
 
4.2%
7
 
3.3%
6
 
2.8%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
Other values (107) 149
70.0%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
K 1
50.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Other Symbol
ValueCountFrequency (%)
10
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 223
94.5%
Common 10
 
4.2%
Latin 3
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
4.9%
10
 
4.5%
9
 
4.0%
7
 
3.1%
6
 
2.7%
6
 
2.7%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
Other values (108) 154
69.1%
Latin
ValueCountFrequency (%)
S 1
33.3%
e 1
33.3%
K 1
33.3%
Common
ValueCountFrequency (%)
10
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 213
90.3%
ASCII 13
 
5.5%
None 10
 
4.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
11
 
5.2%
9
 
4.2%
7
 
3.3%
6
 
2.8%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
Other values (107) 149
70.0%
ASCII
ValueCountFrequency (%)
10
76.9%
S 1
 
7.7%
e 1
 
7.7%
K 1
 
7.7%
None
ValueCountFrequency (%)
10
100.0%

규모
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Memory size328.0 B
대기업
공공기관
중소기업
공기업

Length

Max length4
Median length4
Mean length3.6
Min length3

Unique

Unique1 ?
Unique (%)4.0%

Sample

1st row대기업
2nd row중소기업
3rd row대기업
4th row중소기업
5th row대기업

Common Values

ValueCountFrequency (%)
대기업 9
36.0%
공공기관 9
36.0%
중소기업 6
24.0%
공기업 1
 
4.0%

Length

2024-04-21T02:12:24.490263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T02:12:24.683024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대기업 9
36.0%
공공기관 9
36.0%
중소기업 6
24.0%
공기업 1
 
4.0%

훈격
Categorical

Distinct4
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Memory size328.0 B
장관표창
17 
국무총리표창
장관표창
대통령표창

Length

Max length6
Median length4
Mean length4.44
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대통령표창
2nd row대통령표창
3rd row국무총리표창
4th row국무총리표창
5th row장관표창

Common Values

ValueCountFrequency (%)
장관표창 17
68.0%
국무총리표창 3
 
12.0%
장관표창 3
 
12.0%
대통령표창 2
 
8.0%

Length

2024-04-21T02:12:24.897751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T02:12:25.095005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
장관표창 20
80.0%
국무총리표창 3
 
12.0%
대통령표창 2
 
8.0%

Interactions

2024-04-21T02:12:19.344676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T02:12:25.224457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분야지방관서사업장명규모훈격
연번1.0000.0000.8051.0000.8140.259
분야0.0001.0000.3031.0000.7070.545
지방관서0.8050.3031.0001.0000.7770.754
사업장명1.0001.0001.0001.0001.0001.000
규모0.8140.7070.7771.0001.0000.000
훈격0.2590.5450.7541.0000.0001.000
2024-04-21T02:12:25.394485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
훈격분야규모
훈격1.0000.3490.000
분야0.3491.0000.477
규모0.0000.4771.000
2024-04-21T02:12:25.540854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분야규모훈격
연번1.0000.0000.5350.000
분야0.0001.0000.4770.349
규모0.5350.4771.0000.000
훈격0.0000.3490.0001.000

Missing values

2024-04-21T02:12:19.673262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T02:12:20.017792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번분야지방관서사업장명규모훈격
01남녀고용평등분야서울청SK 주식회사대기업대통령표창
12남녀고용평등분야고양주식회사 선영종합엔지니어링중소기업대통령표창
23남녀고용평등분야서울강남랄프로렌코리아 유한회사대기업국무총리표창
34남녀고용평등분야서울강남㈜골프존중소기업국무총리표창
45남녀고용평등분야서울청㈜코리아세븐대기업장관표창
56남녀고용평등분야서울동부롯데쇼핑㈜ e커머스사업본부대기업장관표창
67남녀고용평등분야서울관악한국후지필름㈜대기업장관표창
78남녀고용평등분야중부청삼성바이오로직스 주식회사대기업장관표창
89남녀고용평등분야서울청㈜대홍기획대기업장관표창
910남녀고용평등분야서울청주식회사 제뉴원사이언스중소기업장관표창
연번분야지방관서사업장명규모훈격
151적극적고용개선조치분야서울청한국방송광고진흥공사공공기관국무총리표창
162적극적고용개선조치분야평택주식회사 원익아이피에스중소기업장관표창
173적극적고용개선조치분야서울동부㈜미라콤아이앤씨대기업장관표창
184적극적고용개선조치분야서울청재단법인 국제방송교류재단공공기관장관표창
195적극적고용개선조치분야서울동부서울올림픽기념국민체육진흥공단공공기관장관표창
206적극적고용개선조치분야서울남부엘지이노텍 ㈜대기업장관표창
217적극적고용개선조치분야충주한국과학기술기획평가원공공기관장관표창
228적극적고용개선조치분야서울북부서울특별시도봉구시설관리공단공공기관장관표창
239적극적고용개선조치분야보령국립생태원공공기관장관표창
2410적극적고용개선조치분야천안한국기술교육대학교공공기관장관표창