Overview

Dataset statistics

Number of variables4
Number of observations5415
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory174.6 KiB
Average record size in memory33.0 B

Variable types

Numeric1
Text1
Categorical2

Dataset

Description근로자가 일과 가정을 양립할 수 있도록 가족친화제도를 모범적으로 수행하고 있는 기업 등에 대하여 여성가족부장관이 인증한 가족친화기업 명단
URLhttps://www.data.go.kr/data/3071994/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:48:00.848903
Analysis finished2023-12-12 12:48:01.537564
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct5415
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2708
Minimum1
Maximum5415
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size47.7 KiB
2023-12-12T21:48:01.647308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile271.7
Q11354.5
median2708
Q34061.5
95-th percentile5144.3
Maximum5415
Range5414
Interquartile range (IQR)2707

Descriptive statistics

Standard deviation1563.3202
Coefficient of variation (CV)0.57729697
Kurtosis-1.2
Mean2708
Median Absolute Deviation (MAD)1354
Skewness0
Sum14663820
Variance2443970
MonotonicityStrictly increasing
2023-12-12T21:48:01.812488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
3608 1
 
< 0.1%
3616 1
 
< 0.1%
3615 1
 
< 0.1%
3614 1
 
< 0.1%
3613 1
 
< 0.1%
3612 1
 
< 0.1%
3611 1
 
< 0.1%
3610 1
 
< 0.1%
3609 1
 
< 0.1%
Other values (5405) 5405
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
5415 1
< 0.1%
5414 1
< 0.1%
5413 1
< 0.1%
5412 1
< 0.1%
5411 1
< 0.1%
5410 1
< 0.1%
5409 1
< 0.1%
5408 1
< 0.1%
5407 1
< 0.1%
5406 1
< 0.1%
Distinct5395
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size42.4 KiB
2023-12-12T21:48:02.010434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length25
Mean length9.2535549
Min length1

Characters and Unicode

Total characters50108
Distinct characters778
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5379 ?
Unique (%)99.3%

Sample

1st row(주)대웅제약
2nd row교보생명보험(주)
3rd row유한킴벌리(주)
4th row건강보험심사평가원
5th row국민건강보험공단
ValueCountFrequency (%)
주식회사 2020
 
24.9%
재단법인 145
 
1.8%
농업회사법인 65
 
0.8%
유한회사 49
 
0.6%
서울특별시 47
 
0.6%
경기도 31
 
0.4%
경상북도 22
 
0.3%
전라남도 21
 
0.3%
강원도 18
 
0.2%
부산광역시 18
 
0.2%
Other values (5462) 5681
70.0%
2023-12-12T21:48:02.346966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3987
 
8.0%
2702
 
5.4%
2635
 
5.3%
2262
 
4.5%
2119
 
4.2%
) 1913
 
3.8%
( 1913
 
3.8%
1306
 
2.6%
1133
 
2.3%
672
 
1.3%
Other values (768) 29466
58.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43008
85.8%
Space Separator 2702
 
5.4%
Close Punctuation 1913
 
3.8%
Open Punctuation 1913
 
3.8%
Uppercase Letter 266
 
0.5%
Decimal Number 185
 
0.4%
Lowercase Letter 80
 
0.2%
Other Punctuation 36
 
0.1%
Other Symbol 3
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3987
 
9.3%
2635
 
6.1%
2262
 
5.3%
2119
 
4.9%
1306
 
3.0%
1133
 
2.6%
672
 
1.6%
574
 
1.3%
567
 
1.3%
564
 
1.3%
Other values (708) 27189
63.2%
Uppercase Letter
ValueCountFrequency (%)
C 31
 
11.7%
S 26
 
9.8%
E 21
 
7.9%
L 18
 
6.8%
N 17
 
6.4%
K 16
 
6.0%
I 15
 
5.6%
T 14
 
5.3%
O 12
 
4.5%
G 12
 
4.5%
Other values (15) 84
31.6%
Lowercase Letter
ValueCountFrequency (%)
o 13
16.2%
e 13
16.2%
d 10
12.5%
r 8
10.0%
t 8
10.0%
s 5
 
6.2%
a 4
 
5.0%
n 3
 
3.8%
i 3
 
3.8%
c 3
 
3.8%
Other values (6) 10
12.5%
Decimal Number
ValueCountFrequency (%)
1 39
21.1%
2 27
14.6%
3 22
11.9%
6 19
10.3%
8 19
10.3%
0 18
9.7%
7 15
 
8.1%
5 13
 
7.0%
9 12
 
6.5%
4 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
. 22
61.1%
& 6
 
16.7%
, 6
 
16.7%
· 2
 
5.6%
Space Separator
ValueCountFrequency (%)
2702
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1913
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1913
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43011
85.8%
Common 6751
 
13.5%
Latin 346
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3987
 
9.3%
2635
 
6.1%
2262
 
5.3%
2119
 
4.9%
1306
 
3.0%
1133
 
2.6%
672
 
1.6%
574
 
1.3%
567
 
1.3%
564
 
1.3%
Other values (709) 27192
63.2%
Latin
ValueCountFrequency (%)
C 31
 
9.0%
S 26
 
7.5%
E 21
 
6.1%
L 18
 
5.2%
N 17
 
4.9%
K 16
 
4.6%
I 15
 
4.3%
T 14
 
4.0%
o 13
 
3.8%
e 13
 
3.8%
Other values (31) 162
46.8%
Common
ValueCountFrequency (%)
2702
40.0%
) 1913
28.3%
( 1913
28.3%
1 39
 
0.6%
2 27
 
0.4%
. 22
 
0.3%
3 22
 
0.3%
6 19
 
0.3%
8 19
 
0.3%
0 18
 
0.3%
Other values (8) 57
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43008
85.8%
ASCII 7095
 
14.2%
None 5
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3987
 
9.3%
2635
 
6.1%
2262
 
5.3%
2119
 
4.9%
1306
 
3.0%
1133
 
2.6%
672
 
1.6%
574
 
1.3%
567
 
1.3%
564
 
1.3%
Other values (708) 27189
63.2%
ASCII
ValueCountFrequency (%)
2702
38.1%
) 1913
27.0%
( 1913
27.0%
1 39
 
0.5%
C 31
 
0.4%
2 27
 
0.4%
S 26
 
0.4%
. 22
 
0.3%
3 22
 
0.3%
E 21
 
0.3%
Other values (48) 379
 
5.3%
None
ValueCountFrequency (%)
3
60.0%
· 2
40.0%

분류
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size42.4 KiB
중소기업
3706 
공공기관
1118 
대기업
591 

Length

Max length4
Median length4
Mean length3.8908587
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대기업
2nd row대기업
3rd row대기업
4th row공공기관
5th row공공기관

Common Values

ValueCountFrequency (%)
중소기업 3706
68.4%
공공기관 1118
 
20.6%
대기업 591
 
10.9%

Length

2023-12-12T21:48:02.578479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:48:02.675089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중소기업 3706
68.4%
공공기관 1118
 
20.6%
대기업 591
 
10.9%

지역
Categorical

Distinct17
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size42.4 KiB
서울
1539 
경기
968 
부산
316 
대전
306 
충북
297 
Other values (12)
1989 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울
2nd row서울
3rd row서울
4th row강원
5th row강원

Common Values

ValueCountFrequency (%)
서울 1539
28.4%
경기 968
17.9%
부산 316
 
5.8%
대전 306
 
5.7%
충북 297
 
5.5%
충남 248
 
4.6%
경북 245
 
4.5%
인천 215
 
4.0%
전남 209
 
3.9%
경남 200
 
3.7%
Other values (7) 872
16.1%

Length

2023-12-12T21:48:02.781394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울 1539
28.4%
경기 968
17.9%
부산 316
 
5.8%
대전 306
 
5.7%
충북 297
 
5.5%
충남 248
 
4.6%
경북 245
 
4.5%
인천 215
 
4.0%
전남 209
 
3.9%
경남 200
 
3.7%
Other values (7) 872
16.1%

Interactions

2023-12-12T21:48:01.303619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:48:02.883422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분류지역
연번1.0000.4750.168
분류0.4751.0000.372
지역0.1680.3721.000
2023-12-12T21:48:03.018679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역분류
지역1.0000.216
분류0.2161.000
2023-12-12T21:48:03.128799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분류지역
연번1.0000.3250.066
분류0.3251.0000.216
지역0.0660.2161.000

Missing values

2023-12-12T21:48:01.413121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:48:01.500815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번기업(관)명분류지역
01(주)대웅제약대기업서울
12교보생명보험(주)대기업서울
23유한킴벌리(주)대기업서울
34건강보험심사평가원공공기관강원
45국민건강보험공단공공기관강원
56국민연금공단공공기관전북
67인천국제공항공사공공기관인천
78제주국제자유도시개발센터공공기관제주
89한국농수산식품유통공사공공기관전남
910(주)삼광대기업경북
연번기업(관)명분류지역
54055406제36보병사단공공기관강원
54065407제7297부대공공기관경기
54075408제9보병사단사령부공공기관경기
54085409질병관리청공공기관충북
54095410창원해양경찰서공공기관경남
54105411한국광해광업공단공공기관강원
54115412한국교육시설안전원공공기관서울
54125413한국여성인권진흥원공공기관서울
54135414한식진흥원공공기관서울
54145415해군작전사령부해군정보단공공기관경남