Overview

Dataset statistics

Number of variables7
Number of observations822
Missing cells98
Missing cells (%)1.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory45.1 KiB
Average record size in memory56.2 B

Variable types

Text3
Categorical3
DateTime1

Dataset

Description한국남부발전(주)_문서정보 목록에 대한 데이터로 문서제목, 기관명, 담당자명, 생산일자, 문서번호 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15119109/fileData.do

Alerts

기관명 has constant value ""Constant
문서번호 has 98 (11.9%) missing valuesMissing

Reproduction

Analysis started2023-12-12 17:19:15.000434
Analysis finished2023-12-12 17:19:15.948372
Duration0.95 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct767
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2023-12-13T02:19:16.297050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length34
Mean length21.33455
Min length2

Characters and Unicode

Total characters17537
Distinct characters419
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique737 ?
Unique (%)89.7%

Sample

1st row육아휴직
2nd row관내배치
3rd row내부회계관리제도 운영세칙 제정(안)
4th row2021∼2025년 KOSPO 보건경영 추진계획
5th row안동보·수하보 소수력 기자재 설치조건부 구매계약 추진
ValueCountFrequency (%)
보고 126
 
3.3%
123
 
3.2%
2022년 90
 
2.4%
개최 84
 
2.2%
시행 81
 
2.1%
개정 69
 
1.8%
2021년 68
 
1.8%
이사회 65
 
1.7%
결과 65
 
1.7%
부의 51
 
1.3%
Other values (1157) 2964
78.3%
2023-12-13T02:19:16.963514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2964
 
16.9%
2 882
 
5.0%
373
 
2.1%
) 347
 
2.0%
( 347
 
2.0%
332
 
1.9%
327
 
1.9%
324
 
1.8%
0 318
 
1.8%
309
 
1.8%
Other values (409) 11014
62.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11624
66.3%
Space Separator 2964
 
16.9%
Decimal Number 1601
 
9.1%
Uppercase Letter 391
 
2.2%
Close Punctuation 363
 
2.1%
Open Punctuation 363
 
2.1%
Other Punctuation 119
 
0.7%
Lowercase Letter 55
 
0.3%
Final Punctuation 32
 
0.2%
Math Symbol 12
 
0.1%
Other values (4) 13
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
373
 
3.2%
332
 
2.9%
327
 
2.8%
324
 
2.8%
309
 
2.7%
305
 
2.6%
301
 
2.6%
264
 
2.3%
212
 
1.8%
209
 
1.8%
Other values (337) 8668
74.6%
Uppercase Letter
ValueCountFrequency (%)
O 95
24.3%
S 59
15.1%
P 54
13.8%
K 45
11.5%
T 30
 
7.7%
F 17
 
4.3%
C 15
 
3.8%
I 12
 
3.1%
E 11
 
2.8%
M 10
 
2.6%
Other values (12) 43
11.0%
Lowercase Letter
ValueCountFrequency (%)
l 7
12.7%
o 6
10.9%
r 6
10.9%
u 5
9.1%
a 5
9.1%
e 4
7.3%
s 4
7.3%
t 4
7.3%
m 3
 
5.5%
k 3
 
5.5%
Other values (5) 8
14.5%
Decimal Number
ValueCountFrequency (%)
2 882
55.1%
0 318
 
19.9%
1 188
 
11.7%
3 111
 
6.9%
5 29
 
1.8%
4 28
 
1.7%
6 15
 
0.9%
9 11
 
0.7%
7 11
 
0.7%
8 8
 
0.5%
Other Punctuation
ValueCountFrequency (%)
, 63
52.9%
· 31
26.1%
/ 12
 
10.1%
& 6
 
5.0%
; 4
 
3.4%
1
 
0.8%
. 1
 
0.8%
# 1
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 347
95.6%
10
 
2.8%
] 3
 
0.8%
3
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 347
95.6%
10
 
2.8%
[ 3
 
0.8%
3
 
0.8%
Math Symbol
ValueCountFrequency (%)
~ 10
83.3%
2
 
16.7%
Other Symbol
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
2964
100.0%
Final Punctuation
ValueCountFrequency (%)
32
100.0%
Initial Punctuation
ValueCountFrequency (%)
4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11604
66.2%
Common 5466
31.2%
Latin 446
 
2.5%
Han 21
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
373
 
3.2%
332
 
2.9%
327
 
2.8%
324
 
2.8%
309
 
2.7%
305
 
2.6%
301
 
2.6%
264
 
2.3%
212
 
1.8%
209
 
1.8%
Other values (336) 8648
74.5%
Latin
ValueCountFrequency (%)
O 95
21.3%
S 59
13.2%
P 54
12.1%
K 45
10.1%
T 30
 
6.7%
F 17
 
3.8%
C 15
 
3.4%
I 12
 
2.7%
E 11
 
2.5%
M 10
 
2.2%
Other values (27) 98
22.0%
Common
ValueCountFrequency (%)
2964
54.2%
2 882
 
16.1%
) 347
 
6.3%
( 347
 
6.3%
0 318
 
5.8%
1 188
 
3.4%
3 111
 
2.0%
, 63
 
1.2%
32
 
0.6%
· 31
 
0.6%
Other values (24) 183
 
3.3%
Han
ValueCountFrequency (%)
20
95.2%
1
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11603
66.2%
ASCII 5815
33.2%
None 58
 
0.3%
Punctuation 37
 
0.2%
CJK 21
 
0.1%
Math Operators 2
 
< 0.1%
CJK Compat 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2964
51.0%
2 882
 
15.2%
) 347
 
6.0%
( 347
 
6.0%
0 318
 
5.5%
1 188
 
3.2%
3 111
 
1.9%
O 95
 
1.6%
, 63
 
1.1%
S 59
 
1.0%
Other values (51) 441
 
7.6%
Hangul
ValueCountFrequency (%)
373
 
3.2%
332
 
2.9%
327
 
2.8%
324
 
2.8%
309
 
2.7%
305
 
2.6%
301
 
2.6%
264
 
2.3%
212
 
1.8%
209
 
1.8%
Other values (335) 8647
74.5%
Punctuation
ValueCountFrequency (%)
32
86.5%
4
 
10.8%
1
 
2.7%
None
ValueCountFrequency (%)
· 31
53.4%
10
 
17.2%
10
 
17.2%
3
 
5.2%
3
 
5.2%
1
 
1.7%
CJK
ValueCountFrequency (%)
20
95.2%
1
 
4.8%
Math Operators
ValueCountFrequency (%)
2
100.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%

기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
한국남부발전㈜
822 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국남부발전㈜
2nd row한국남부발전㈜
3rd row한국남부발전㈜
4th row한국남부발전㈜
5th row한국남부발전㈜

Common Values

ValueCountFrequency (%)
한국남부발전㈜ 822
100.0%

Length

2023-12-13T02:19:17.178272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:19:17.346366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국남부발전㈜ 822
100.0%
Distinct202
Distinct (%)24.6%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2023-12-13T02:19:17.774644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9963504
Min length2

Characters and Unicode

Total characters2463
Distinct characters132
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)9.7%

Sample

1st row배재민
2nd row배재민
3rd row손형인
4th row박상균
5th row조미선
ValueCountFrequency (%)
홍승화 41
 
5.0%
이동휘 41
 
5.0%
최한울 33
 
4.0%
조남형 31
 
3.8%
정옥수 23
 
2.8%
손진호 22
 
2.7%
김범규 18
 
2.2%
서병석 16
 
1.9%
김기현 13
 
1.6%
강지혜 13
 
1.6%
Other values (192) 571
69.5%
2023-12-13T02:19:18.388112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
164
 
6.7%
111
 
4.5%
93
 
3.8%
66
 
2.7%
64
 
2.6%
64
 
2.6%
59
 
2.4%
57
 
2.3%
56
 
2.3%
53
 
2.2%
Other values (122) 1676
68.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2463
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
164
 
6.7%
111
 
4.5%
93
 
3.8%
66
 
2.7%
64
 
2.6%
64
 
2.6%
59
 
2.4%
57
 
2.3%
56
 
2.3%
53
 
2.2%
Other values (122) 1676
68.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2463
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
164
 
6.7%
111
 
4.5%
93
 
3.8%
66
 
2.7%
64
 
2.6%
64
 
2.6%
59
 
2.4%
57
 
2.3%
56
 
2.3%
53
 
2.2%
Other values (122) 1676
68.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2463
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
164
 
6.7%
111
 
4.5%
93
 
3.8%
66
 
2.7%
64
 
2.6%
64
 
2.6%
59
 
2.4%
57
 
2.3%
56
 
2.3%
53
 
2.2%
Other values (122) 1676
68.0%
Distinct404
Distinct (%)49.1%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
Minimum2021-01-04 00:00:00
Maximum2023-07-31 00:00:00
2023-12-13T02:19:18.588304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:19:18.795521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

문서번호
Text

MISSING 

Distinct724
Distinct (%)100.0%
Missing98
Missing (%)11.9%
Memory size6.6 KiB
2023-12-13T02:19:19.133557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length14.904696
Min length6

Characters and Unicode

Total characters10791
Distinct characters139
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique724 ?
Unique (%)100.0%

Sample

1st row기획(윤리준법)-287
2nd row안전(공정안전)-310
3rd row조달협력(계약)-644
4th row건설(계전기술)-655
5th row안전(산업안전)-988
ValueCountFrequency (%)
관리(인재경영)-46740 1
 
0.1%
수소융합(수소기술)-65959 1
 
0.1%
관리(인재경영)-72754 1
 
0.1%
감사(청렴감사)-73667 1
 
0.1%
관리(인재경영)-74381 1
 
0.1%
감사(청렴감사)-74610 1
 
0.1%
관리(세무회계)-74623 1
 
0.1%
관리(세무회계)-74625 1
 
0.1%
안전경영(재난관리)-74645 1
 
0.1%
관리(인재경영)-75466 1
 
0.1%
Other values (714) 714
98.6%
2023-12-13T02:19:19.659372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 724
 
6.7%
( 708
 
6.6%
) 708
 
6.6%
4 411
 
3.8%
6 375
 
3.5%
1 372
 
3.4%
5 365
 
3.4%
7 356
 
3.3%
3 349
 
3.2%
2 345
 
3.2%
Other values (129) 6078
56.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4916
45.6%
Decimal Number 3542
32.8%
Dash Punctuation 724
 
6.7%
Open Punctuation 708
 
6.6%
Close Punctuation 708
 
6.6%
Uppercase Letter 192
 
1.8%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
325
 
6.6%
301
 
6.1%
275
 
5.6%
266
 
5.4%
238
 
4.8%
226
 
4.6%
224
 
4.6%
183
 
3.7%
174
 
3.5%
148
 
3.0%
Other values (110) 2556
52.0%
Decimal Number
ValueCountFrequency (%)
4 411
11.6%
6 375
10.6%
1 372
10.5%
5 365
10.3%
7 356
10.1%
3 349
9.9%
2 345
9.7%
9 340
9.6%
8 333
9.4%
0 296
8.4%
Uppercase Letter
ValueCountFrequency (%)
G 64
33.3%
E 61
31.8%
S 61
31.8%
N 3
 
1.6%
L 3
 
1.6%
Dash Punctuation
ValueCountFrequency (%)
- 724
100.0%
Open Punctuation
ValueCountFrequency (%)
( 708
100.0%
Close Punctuation
ValueCountFrequency (%)
) 708
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5683
52.7%
Hangul 4916
45.6%
Latin 192
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
325
 
6.6%
301
 
6.1%
275
 
5.6%
266
 
5.4%
238
 
4.8%
226
 
4.6%
224
 
4.6%
183
 
3.7%
174
 
3.5%
148
 
3.0%
Other values (110) 2556
52.0%
Common
ValueCountFrequency (%)
- 724
12.7%
( 708
12.5%
) 708
12.5%
4 411
7.2%
6 375
 
6.6%
1 372
 
6.5%
5 365
 
6.4%
7 356
 
6.3%
3 349
 
6.1%
2 345
 
6.1%
Other values (4) 970
17.1%
Latin
ValueCountFrequency (%)
G 64
33.3%
E 61
31.8%
S 61
31.8%
N 3
 
1.6%
L 3
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5875
54.4%
Hangul 4916
45.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 724
12.3%
( 708
12.1%
) 708
12.1%
4 411
 
7.0%
6 375
 
6.4%
1 372
 
6.3%
5 365
 
6.2%
7 356
 
6.1%
3 349
 
5.9%
2 345
 
5.9%
Other values (9) 1162
19.8%
Hangul
ValueCountFrequency (%)
325
 
6.6%
301
 
6.1%
275
 
5.6%
266
 
5.4%
238
 
4.8%
226
 
4.6%
224
 
4.6%
183
 
3.7%
174
 
3.5%
148
 
3.0%
Other values (110) 2556
52.0%

보존기간
Categorical

Distinct6
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
5년
229 
10년
210 
영구
151 
30년
107 
준영구
88 

Length

Max length3
Median length2
Mean length2.4927007
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영구
2nd row영구
3rd row10년
4th row10년
5th row30년

Common Values

ValueCountFrequency (%)
5년 229
27.9%
10년 210
25.5%
영구 151
18.4%
30년 107
13.0%
준영구 88
 
10.7%
3년 37
 
4.5%

Length

2023-12-13T02:19:19.838771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:19:20.027999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5년 229
27.9%
10년 210
25.5%
영구 151
18.4%
30년 107
13.0%
준영구 88
 
10.7%
3년 37
 
4.5%

공개여부
Categorical

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
공개
570 
부분공개
252 

Length

Max length4
Median length2
Mean length2.6131387
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공개
2nd row공개
3rd row공개
4th row공개
5th row부분공개

Common Values

ValueCountFrequency (%)
공개 570
69.3%
부분공개 252
30.7%

Length

2023-12-13T02:19:20.200724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:19:20.327755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공개 570
69.3%
부분공개 252
30.7%

Correlations

2023-12-13T02:19:20.395749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
보존기간공개여부
보존기간1.0000.416
공개여부0.4161.000
2023-12-13T02:19:20.504105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
보존기간공개여부
보존기간1.0000.299
공개여부0.2991.000
2023-12-13T02:19:20.624746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
보존기간공개여부
보존기간1.0000.299
공개여부0.2991.000

Missing values

2023-12-13T02:19:15.697010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:19:15.877410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

문서제목기관명담당자명생산일자문서번호보존기간공개여부
0육아휴직한국남부발전㈜배재민2021-01-04<NA>영구공개
1관내배치한국남부발전㈜배재민2021-01-04<NA>영구공개
2내부회계관리제도 운영세칙 제정(안)한국남부발전㈜손형인2021-01-04기획(윤리준법)-28710년공개
32021∼2025년 KOSPO 보건경영 추진계획한국남부발전㈜박상균2021-01-04안전(공정안전)-31010년공개
4안동보·수하보 소수력 기자재 설치조건부 구매계약 추진한국남부발전㈜조미선2021-01-05조달협력(계약)-64430년부분공개
5건설공사 발주계획 보고한국남부발전㈜김성엽2021-01-06건설(계전기술)-65510년공개
6「KOSPO 노사안전보건협의회」운영 개선(안)한국남부발전㈜이상현2021-01-07안전(산업안전)-9885년공개
7「사업선정위원회」명칭 변경한국남부발전㈜박경순2021-01-08해외사업(개발3)-13905년공개
8대정해상풍력 발전사업 출자회사 파견인력 충원 요청한국남부발전㈜이상규2021-01-14국내사업(풍력)-281130년공개
9삼척화력 변동비 절감 T/F 운영(案)한국남부발전㈜안영헌2021-01-14발전(화력운영)-293810년공개
문서제목기관명담당자명생산일자문서번호보존기간공개여부
812파견해제 및 이동한국남부발전㈜조남형2023-07-12<NA>영구공개
813공무 국외출장 시행(베트남)한국남부발전㈜이주형2023-07-14해외사업(개발1부)-5044110년부분공개
814「발전소 운전정비규정」개정 시행한국남부발전㈜정진석2023-07-14발전(발전기획)-504425년공개
815통합투자사업관리규정 제정(안) 보고한국남부발전㈜정호일2023-07-14ESG기획(사업금융)-5048230년공개
8162023년도 제1차 KOSPO 안전경영위원회 결과 보고한국남부발전㈜이명호2023-07-14안전경영(안전총괄)-504925년공개
817No Paper 추진 종합계획(안)한국남부발전㈜김종신2023-07-22디지털정보(기획)-526635년공개
818법무규정 개정 추진한국남부발전㈜조현지2023-07-24ESG기획(윤리준법)-53032영구공개
819정비외자(계측제어분야) 장기 통합단가계약 추진(Emerson)한국남부발전㈜조현범2023-07-26조달협력(계약자재)-5358610년부분공개
820공무 국외출장 결과 보고(베트남)한국남부발전㈜백영민2023-07-28해외사업(개발1부)-5469310년공개
821파견연장한국남부발전㈜조남형2023-07-31<NA>영구공개