Overview

Dataset statistics

Number of variables5
Number of observations476
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory19.7 KiB
Average record size in memory42.3 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description한국가스안전공사, 한국소비자원, 한국고용정보원의 정기 간행물 및 발간물 리스트 및 해당 자료를 쉽게 다운받거나 바로 읽어 볼수 있도록 URL을 제공합니다.
URLhttps://www.data.go.kr/data/15120475/fileData.do

Alerts

번호 is highly overall correlated with 기관 and 1 other fieldsHigh correlation
기관 is highly overall correlated with 번호High correlation
등록일 is highly overall correlated with 번호High correlation
기관 is highly imbalanced (59.1%)Imbalance
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:08:55.506780
Analysis finished2023-12-12 05:08:56.207517
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct476
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean238.5
Minimum1
Maximum476
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2023-12-12T14:08:56.312713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile24.75
Q1119.75
median238.5
Q3357.25
95-th percentile452.25
Maximum476
Range475
Interquartile range (IQR)237.5

Descriptive statistics

Standard deviation137.55363
Coefficient of variation (CV)0.57674476
Kurtosis-1.2
Mean238.5
Median Absolute Deviation (MAD)119
Skewness0
Sum113526
Variance18921
MonotonicityStrictly increasing
2023-12-12T14:08:56.526744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
315 1
 
0.2%
327 1
 
0.2%
326 1
 
0.2%
325 1
 
0.2%
324 1
 
0.2%
323 1
 
0.2%
322 1
 
0.2%
321 1
 
0.2%
320 1
 
0.2%
Other values (466) 466
97.9%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
476 1
0.2%
475 1
0.2%
474 1
0.2%
473 1
0.2%
472 1
0.2%
471 1
0.2%
470 1
0.2%
469 1
0.2%
468 1
0.2%
467 1
0.2%

기관
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
한국고용정보원
417 
한국소비자원
42 
한국가스안전공사
 
17

Length

Max length8
Median length7
Mean length6.947479
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국고용정보원
2nd row한국고용정보원
3rd row한국고용정보원
4th row한국고용정보원
5th row한국고용정보원

Common Values

ValueCountFrequency (%)
한국고용정보원 417
87.6%
한국소비자원 42
 
8.8%
한국가스안전공사 17
 
3.6%

Length

2023-12-12T14:08:56.714941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:08:56.843772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국고용정보원 417
87.6%
한국소비자원 42
 
8.8%
한국가스안전공사 17
 
3.6%

제목
Text

Distinct418
Distinct (%)87.8%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-12T14:08:57.311745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length43
Mean length20.821429
Min length4

Characters and Unicode

Total characters9911
Distinct characters405
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique400 ?
Unique (%)84.0%

Sample

1st row중·고령층 좋은 일자리로 재취업 주요 요인 분석
2nd row고용동향브리프 2023년 4호
3rd row인력수급 전망 방법론 연구
4th row산업별 인력수급 영향요인 분석
5th row2022 KEIS-고용 DB 분석
ValueCountFrequency (%)
분석 63
 
2.8%
62
 
2.8%
2021년 34
 
1.5%
2022년 33
 
1.5%
소비자시대 32
 
1.4%
동향 30
 
1.3%
연구 30
 
1.3%
노동시장 26
 
1.2%
고용 25
 
1.1%
고용동향브리프 24
 
1.1%
Other values (866) 1879
84.0%
2023-12-12T14:08:58.011546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1762
 
17.8%
2 408
 
4.1%
0 237
 
2.4%
207
 
2.1%
187
 
1.9%
185
 
1.9%
177
 
1.8%
150
 
1.5%
150
 
1.5%
149
 
1.5%
Other values (395) 6299
63.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6815
68.8%
Space Separator 1762
 
17.8%
Decimal Number 891
 
9.0%
Uppercase Letter 103
 
1.0%
Open Punctuation 79
 
0.8%
Close Punctuation 79
 
0.8%
Other Punctuation 62
 
0.6%
Lowercase Letter 62
 
0.6%
Dash Punctuation 49
 
0.5%
Math Symbol 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
207
 
3.0%
187
 
2.7%
185
 
2.7%
177
 
2.6%
150
 
2.2%
150
 
2.2%
149
 
2.2%
118
 
1.7%
109
 
1.6%
104
 
1.5%
Other values (341) 5279
77.5%
Uppercase Letter
ValueCountFrequency (%)
I 20
19.4%
D 14
13.6%
B 12
11.7%
E 11
10.7%
K 7
 
6.8%
W 6
 
5.8%
N 5
 
4.9%
O 5
 
4.9%
S 4
 
3.9%
T 3
 
2.9%
Other values (11) 16
15.5%
Decimal Number
ValueCountFrequency (%)
2 408
45.8%
0 237
26.6%
1 102
 
11.4%
3 52
 
5.8%
9 31
 
3.5%
6 16
 
1.8%
7 13
 
1.5%
5 12
 
1.3%
4 11
 
1.2%
8 9
 
1.0%
Lowercase Letter
ValueCountFrequency (%)
o 23
37.1%
f 17
27.4%
n 13
21.0%
r 4
 
6.5%
j 2
 
3.2%
b 2
 
3.2%
t 1
 
1.6%
Other Punctuation
ValueCountFrequency (%)
. 28
45.2%
, 13
21.0%
: 10
 
16.1%
· 8
 
12.9%
' 2
 
3.2%
? 1
 
1.6%
Open Punctuation
ValueCountFrequency (%)
( 57
72.2%
[ 21
 
26.6%
1
 
1.3%
Close Punctuation
ValueCountFrequency (%)
) 57
72.2%
] 21
 
26.6%
1
 
1.3%
Space Separator
ValueCountFrequency (%)
1762
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 49
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6815
68.8%
Common 2930
29.6%
Latin 166
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
207
 
3.0%
187
 
2.7%
185
 
2.7%
177
 
2.6%
150
 
2.2%
150
 
2.2%
149
 
2.2%
118
 
1.7%
109
 
1.6%
104
 
1.5%
Other values (341) 5279
77.5%
Latin
ValueCountFrequency (%)
o 23
13.9%
I 20
12.0%
f 17
10.2%
D 14
 
8.4%
n 13
 
7.8%
B 12
 
7.2%
E 11
 
6.6%
K 7
 
4.2%
W 6
 
3.6%
N 5
 
3.0%
Other values (19) 38
22.9%
Common
ValueCountFrequency (%)
1762
60.1%
2 408
 
13.9%
0 237
 
8.1%
1 102
 
3.5%
( 57
 
1.9%
) 57
 
1.9%
3 52
 
1.8%
- 49
 
1.7%
9 31
 
1.1%
. 28
 
1.0%
Other values (15) 147
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6815
68.8%
ASCII 3085
31.1%
None 10
 
0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1762
57.1%
2 408
 
13.2%
0 237
 
7.7%
1 102
 
3.3%
( 57
 
1.8%
) 57
 
1.8%
3 52
 
1.7%
- 49
 
1.6%
9 31
 
1.0%
. 28
 
0.9%
Other values (40) 302
 
9.8%
Hangul
ValueCountFrequency (%)
207
 
3.0%
187
 
2.7%
185
 
2.7%
177
 
2.6%
150
 
2.2%
150
 
2.2%
149
 
2.2%
118
 
1.7%
109
 
1.6%
104
 
1.5%
Other values (341) 5279
77.5%
None
ValueCountFrequency (%)
· 8
80.0%
1
 
10.0%
1
 
10.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

등록일
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2022
219 
2021
142 
2023
115 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2022 219
46.0%
2021 142
29.8%
2023 115
24.2%

Length

2023-12-12T14:08:58.229007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:08:58.367069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 219
46.0%
2021 142
29.8%
2023 115
24.2%
Distinct449
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-12T14:08:58.633746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length161
Median length136
Mean length122.81723
Min length38

Characters and Unicode

Total characters58461
Distinct characters54
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique435 ?
Unique (%)91.4%

Sample

1st rowhttps://www.keis.or.kr/user/extra/main/2107/publication/reportList/jsp/LayOutPage.do?categoryIdx=126&pubIdx=9934&reportIdx=6141
2nd rowhttps://www.keis.or.kr/user/extra/main/2102/publication/publicationList/jsp/LayOutPage.do?categoryIdx=126&pubIdx=9934&spage=1&onlyList=N
3rd rowhttps://www.keis.or.kr/user/extra/main/2102/publication/publicationList/jsp/LayOutPage.do?categoryIdx=131&pubIdx=9933&spage=1&onlyList=N
4th rowhttps://www.keis.or.kr/user/extra/main/2102/publication/publicationList/jsp/LayOutPage.do?categoryIdx=131&pubIdx=9932&spage=1&onlyList=N
5th rowhttps://www.keis.or.kr/user/extra/main/2102/publication/publicationList/jsp/LayOutPage.do?categoryIdx=131&pubIdx=9921&spage=1&onlyList=N
ValueCountFrequency (%)
https://www.kgs.or.kr/kgs/age/board.do 15
 
3.2%
https://www.kca.go.kr/webzine/kca_2207 2
 
0.4%
https://www.kca.go.kr/webzine/kca_2211 2
 
0.4%
https://www.kca.go.kr/webzine/kca_2212 2
 
0.4%
https://www.kca.go.kr/webzine/kca_2210 2
 
0.4%
https://www.kca.go.kr/webzine/kca_2209 2
 
0.4%
https://www.kca.go.kr/webzine/kca_2206 2
 
0.4%
https://www.kca.go.kr/webzine/kca_2208 2
 
0.4%
https://www.kca.go.kr/webzine/kca_2204 2
 
0.4%
https://www.kgs.or.kr/kgs/abaf/view.do 2
 
0.4%
Other values (439) 443
93.1%
2023-12-12T14:08:59.089937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 4423
 
7.6%
t 3881
 
6.6%
a 3074
 
5.3%
r 2996
 
5.1%
e 2848
 
4.9%
i 2795
 
4.8%
s 2662
 
4.6%
o 2643
 
4.5%
p 2561
 
4.4%
u 1920
 
3.3%
Other values (44) 28658
49.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 38922
66.6%
Other Punctuation 8504
 
14.5%
Decimal Number 6087
 
10.4%
Uppercase Letter 3361
 
5.7%
Math Symbol 1513
 
2.6%
Space Separator 42
 
0.1%
Connector Punctuation 32
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 3881
 
10.0%
a 3074
 
7.9%
r 2996
 
7.7%
e 2848
 
7.3%
i 2795
 
7.2%
s 2662
 
6.8%
o 2643
 
6.8%
p 2561
 
6.6%
u 1920
 
4.9%
d 1508
 
3.9%
Other values (15) 12034
30.9%
Decimal Number
ValueCountFrequency (%)
2 1362
22.4%
1 1353
22.2%
0 724
11.9%
3 539
 
8.9%
7 497
 
8.2%
9 429
 
7.0%
5 370
 
6.1%
6 336
 
5.5%
8 293
 
4.8%
4 184
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
L 1056
31.4%
I 1029
30.6%
P 417
 
12.4%
O 417
 
12.4%
N 222
 
6.6%
E 70
 
2.1%
B 50
 
1.5%
C 50
 
1.5%
D 40
 
1.2%
K 10
 
0.3%
Other Punctuation
ValueCountFrequency (%)
/ 4423
52.0%
. 1872
22.0%
& 1086
 
12.8%
: 476
 
5.6%
? 427
 
5.0%
% 220
 
2.6%
Math Symbol
ValueCountFrequency (%)
= 1513
100.0%
Space Separator
ValueCountFrequency (%)
42
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 42283
72.3%
Common 16178
 
27.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 3881
 
9.2%
a 3074
 
7.3%
r 2996
 
7.1%
e 2848
 
6.7%
i 2795
 
6.6%
s 2662
 
6.3%
o 2643
 
6.3%
p 2561
 
6.1%
u 1920
 
4.5%
d 1508
 
3.6%
Other values (25) 15395
36.4%
Common
ValueCountFrequency (%)
/ 4423
27.3%
. 1872
11.6%
= 1513
 
9.4%
2 1362
 
8.4%
1 1353
 
8.4%
& 1086
 
6.7%
0 724
 
4.5%
3 539
 
3.3%
7 497
 
3.1%
: 476
 
2.9%
Other values (9) 2333
14.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 58461
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 4423
 
7.6%
t 3881
 
6.6%
a 3074
 
5.3%
r 2996
 
5.1%
e 2848
 
4.9%
i 2795
 
4.8%
s 2662
 
4.6%
o 2643
 
4.5%
p 2561
 
4.4%
u 1920
 
3.3%
Other values (44) 28658
49.0%

Interactions

2023-12-12T14:08:55.899266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:08:59.197623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호기관등록일
번호1.0000.7730.910
기관0.7731.0000.072
등록일0.9100.0721.000
2023-12-12T14:08:59.299865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관등록일
기관1.0000.021
등록일0.0211.000
2023-12-12T14:08:59.405603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호기관등록일
번호1.0000.6470.867
기관0.6471.0000.021
등록일0.8670.0211.000

Missing values

2023-12-12T14:08:56.019632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:08:56.156122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호기관제목등록일경로주소(URL)
01한국고용정보원중·고령층 좋은 일자리로 재취업 주요 요인 분석2023https://www.keis.or.kr/user/extra/main/2107/publication/reportList/jsp/LayOutPage.do?categoryIdx=126&pubIdx=9934&reportIdx=6141
12한국고용정보원고용동향브리프 2023년 4호2023https://www.keis.or.kr/user/extra/main/2102/publication/publicationList/jsp/LayOutPage.do?categoryIdx=126&pubIdx=9934&spage=1&onlyList=N
23한국고용정보원인력수급 전망 방법론 연구2023https://www.keis.or.kr/user/extra/main/2102/publication/publicationList/jsp/LayOutPage.do?categoryIdx=131&pubIdx=9933&spage=1&onlyList=N
34한국고용정보원산업별 인력수급 영향요인 분석2023https://www.keis.or.kr/user/extra/main/2102/publication/publicationList/jsp/LayOutPage.do?categoryIdx=131&pubIdx=9932&spage=1&onlyList=N
45한국고용정보원2022 KEIS-고용 DB 분석2023https://www.keis.or.kr/user/extra/main/2102/publication/publicationList/jsp/LayOutPage.do?categoryIdx=131&pubIdx=9921&spage=1&onlyList=N
56한국고용정보원공공취업지원서비스 이용자의 경로 의존성과 성과의 취약성2023https://www.keis.or.kr/user/extra/main/2102/publication/publicationList/jsp/LayOutPage.do?categoryIdx=131&pubIdx=9920&spage=1&onlyList=N
67한국고용정보원DW 연구 분석 자료 구축 및 분석2023https://www.keis.or.kr/user/extra/main/2102/publication/publicationList/jsp/LayOutPage.do?categoryIdx=131&pubIdx=9919&spage=1&onlyList=N
78한국고용정보원고용보험 DB를 활용한 청년 일자리와 정책 성과 분석2023https://www.keis.or.kr/user/extra/main/2102/publication/publicationList/jsp/LayOutPage.do?categoryIdx=131&pubIdx=9918&spage=1&onlyList=N
89한국고용정보원특수형태근로종사자 현황과 사회 보험 확대를 통한 보호 방안2023https://www.keis.or.kr/user/extra/main/2102/publication/publicationList/jsp/LayOutPage.do?categoryIdx=131&pubIdx=9917&spage=1&onlyList=N
910한국고용정보원행정DB를 이용한 노동력변동 분석2023https://www.keis.or.kr/user/extra/main/2102/publication/publicationList/jsp/LayOutPage.do?categoryIdx=131&pubIdx=9916&spage=1&onlyList=N
번호기관제목등록일경로주소(URL)
466467한국가스안전공사한국가스안전공사 정기간행물(가스안전지)2022https://www.kgs.or.kr/kgs/age/board.do
467468한국가스안전공사한국가스안전공사 정기간행물(가스안전지)2022https://www.kgs.or.kr/kgs/age/board.do
468469한국가스안전공사한국가스안전공사 정기간행물(가스안전지)2022https://www.kgs.or.kr/kgs/age/board.do
469470한국가스안전공사한국가스안전공사 정기간행물(가스안전지)2022https://www.kgs.or.kr/kgs/age/board.do
470471한국가스안전공사한국가스안전공사 정기간행물(가스안전지)2022https://www.kgs.or.kr/kgs/age/board.do
471472한국가스안전공사한국가스안전공사 정기간행물(가스안전지)2023https://www.kgs.or.kr/kgs/age/board.do
472473한국가스안전공사한국가스안전공사 정기간행물(가스안전지)2023https://www.kgs.or.kr/kgs/age/board.do
473474한국가스안전공사한국가스안전공사 정기간행물(가스안전지)2023https://www.kgs.or.kr/kgs/age/board.do
474475한국가스안전공사한국가스안전공사 홍보책자(가스안전 길잡이)2021https://www.kgs.or.kr/kgs/abaf/view.do
475476한국가스안전공사한국가스안전공사 홍보책자(가스안전 길잡이)2021https://www.kgs.or.kr/kgs/abaf/view.do