Overview

Dataset statistics

Number of variables8
Number of observations1807
Missing cells9035
Missing cells (%)62.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory121.9 KiB
Average record size in memory69.1 B

Variable types

Text2
Boolean1
Unsupported5

Dataset

Description홈페이지 관련 테이블, 우편번호, 게시판 그룹, 회원그룹, 행정코드, 게시판 구조 등의 정보입니다.
Author국가평생교육진흥원
URLhttps://www.data.go.kr/data/15072252/fileData.do

Alerts

Unnamed: 3 has 1807 (100.0%) missing valuesMissing
Unnamed: 4 has 1807 (100.0%) missing valuesMissing
Unnamed: 5 has 1807 (100.0%) missing valuesMissing
Unnamed: 6 has 1807 (100.0%) missing valuesMissing
Unnamed: 7 has 1807 (100.0%) missing valuesMissing
첨부파일ID has unique valuesUnique
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 03:17:11.393823
Analysis finished2023-12-12 03:17:12.256032
Duration0.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

첨부파일ID
Text

UNIQUE 

Distinct1807
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size14.2 KiB
2023-12-12T12:17:12.463018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length20
Mean length20
Min length20

Characters and Unicode

Total characters36140
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1807 ?
Unique (%)100.0%

Sample

1st rowFILE_000000000000571
2nd rowFILE_000000000000591
3rd rowFILE_000000000000592
4th rowFILE_000000000000593
5th rowFILE_000000000000594
ValueCountFrequency (%)
file_000000000000571 1
 
0.1%
file_000000000003737 1
 
0.1%
file_000000000003965 1
 
0.1%
file_000000000003964 1
 
0.1%
file_000000000003963 1
 
0.1%
file_000000000003962 1
 
0.1%
file_000000000003961 1
 
0.1%
file_000000000003950 1
 
0.1%
file_000000000003948 1
 
0.1%
file_000000000003947 1
 
0.1%
Other values (1797) 1797
99.4%
2023-12-12T12:17:12.998193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 20587
57.0%
F 1807
 
5.0%
I 1807
 
5.0%
L 1807
 
5.0%
E 1807
 
5.0%
_ 1807
 
5.0%
4 1027
 
2.8%
3 960
 
2.7%
1 920
 
2.5%
2 861
 
2.4%
Other values (5) 2750
 
7.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 27105
75.0%
Uppercase Letter 7228
 
20.0%
Connector Punctuation 1807
 
5.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 20587
76.0%
4 1027
 
3.8%
3 960
 
3.5%
1 920
 
3.4%
2 861
 
3.2%
5 726
 
2.7%
6 539
 
2.0%
8 506
 
1.9%
9 504
 
1.9%
7 475
 
1.8%
Uppercase Letter
ValueCountFrequency (%)
F 1807
25.0%
I 1807
25.0%
L 1807
25.0%
E 1807
25.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1807
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 28912
80.0%
Latin 7228
 
20.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 20587
71.2%
_ 1807
 
6.2%
4 1027
 
3.6%
3 960
 
3.3%
1 920
 
3.2%
2 861
 
3.0%
5 726
 
2.5%
6 539
 
1.9%
8 506
 
1.8%
9 504
 
1.7%
Latin
ValueCountFrequency (%)
F 1807
25.0%
I 1807
25.0%
L 1807
25.0%
E 1807
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 36140
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 20587
57.0%
F 1807
 
5.0%
I 1807
 
5.0%
L 1807
 
5.0%
E 1807
 
5.0%
_ 1807
 
5.0%
4 1027
 
2.8%
3 960
 
2.7%
1 920
 
2.5%
2 861
 
2.4%
Other values (5) 2750
 
7.6%
Distinct1772
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size14.2 KiB
2023-12-12T12:17:13.395922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length21
Mean length21.203652
Min length21

Characters and Unicode

Total characters38315
Distinct characters16
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1737 ?
Unique (%)96.1%

Sample

1st row2015-09-01 오후 4:27:50
2nd row2015-09-02 오후 2:29:39
3rd row2015-09-02 오후 2:39:39
4th row2015-09-02 오후 2:43:23
5th row2015-09-02 오후 2:52:09
ValueCountFrequency (%)
오후 1124
 
20.7%
오전 683
 
12.6%
2016-12-06 118
 
2.2%
2017-12-13 35
 
0.6%
2017-08-02 34
 
0.6%
2018-02-13 27
 
0.5%
2017-12-12 25
 
0.5%
2016-03-22 25
 
0.5%
2017-04-10 23
 
0.4%
2020-03-17 22
 
0.4%
Other values (2200) 3305
61.0%
2023-12-12T12:17:13.982502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 5408
14.1%
1 4643
12.1%
2 4584
12.0%
- 3614
9.4%
3614
9.4%
: 3614
9.4%
1807
 
4.7%
3 1676
 
4.4%
4 1566
 
4.1%
5 1522
 
4.0%
Other values (6) 6267
16.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 23859
62.3%
Dash Punctuation 3614
 
9.4%
Space Separator 3614
 
9.4%
Other Punctuation 3614
 
9.4%
Other Letter 3614
 
9.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 5408
22.7%
1 4643
19.5%
2 4584
19.2%
3 1676
 
7.0%
4 1566
 
6.6%
5 1522
 
6.4%
9 1455
 
6.1%
6 1101
 
4.6%
8 988
 
4.1%
7 916
 
3.8%
Other Letter
ValueCountFrequency (%)
1807
50.0%
1124
31.1%
683
 
18.9%
Dash Punctuation
ValueCountFrequency (%)
- 3614
100.0%
Space Separator
ValueCountFrequency (%)
3614
100.0%
Other Punctuation
ValueCountFrequency (%)
: 3614
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 34701
90.6%
Hangul 3614
 
9.4%

Most frequent character per script

Common
ValueCountFrequency (%)
0 5408
15.6%
1 4643
13.4%
2 4584
13.2%
- 3614
10.4%
3614
10.4%
: 3614
10.4%
3 1676
 
4.8%
4 1566
 
4.5%
5 1522
 
4.4%
9 1455
 
4.2%
Other values (3) 3005
8.7%
Hangul
ValueCountFrequency (%)
1807
50.0%
1124
31.1%
683
 
18.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 34701
90.6%
Hangul 3614
 
9.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 5408
15.6%
1 4643
13.4%
2 4584
13.2%
- 3614
10.4%
3614
10.4%
: 3614
10.4%
3 1676
 
4.8%
4 1566
 
4.5%
5 1522
 
4.4%
9 1455
 
4.2%
Other values (3) 3005
8.7%
Hangul
ValueCountFrequency (%)
1807
50.0%
1124
31.1%
683
 
18.9%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
True
1509 
False
298 
ValueCountFrequency (%)
True 1509
83.5%
False 298
 
16.5%
2023-12-12T12:17:14.162021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1807
Missing (%)100.0%
Memory size16.0 KiB

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1807
Missing (%)100.0%
Memory size16.0 KiB

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1807
Missing (%)100.0%
Memory size16.0 KiB

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1807
Missing (%)100.0%
Memory size16.0 KiB

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1807
Missing (%)100.0%
Memory size16.0 KiB

Missing values

2023-12-12T12:17:11.988250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:17:12.181633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

첨부파일ID생성일시사용여부Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7
0FILE_0000000000005712015-09-01 오후 4:27:50Y<NA><NA><NA><NA><NA>
1FILE_0000000000005912015-09-02 오후 2:29:39N<NA><NA><NA><NA><NA>
2FILE_0000000000005922015-09-02 오후 2:39:39N<NA><NA><NA><NA><NA>
3FILE_0000000000005932015-09-02 오후 2:43:23Y<NA><NA><NA><NA><NA>
4FILE_0000000000005942015-09-02 오후 2:52:09Y<NA><NA><NA><NA><NA>
5FILE_0000000000005952015-09-02 오후 2:52:42N<NA><NA><NA><NA><NA>
6FILE_0000000000005962015-09-02 오후 2:52:52Y<NA><NA><NA><NA><NA>
7FILE_0000000000005972015-09-02 오후 2:53:00N<NA><NA><NA><NA><NA>
8FILE_0000000000005982015-09-02 오후 2:53:09Y<NA><NA><NA><NA><NA>
9FILE_0000000000005992015-09-02 오후 3:17:34Y<NA><NA><NA><NA><NA>
첨부파일ID생성일시사용여부Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7
1797FILE_0000000000054862020-09-25 오후 7:31:03Y<NA><NA><NA><NA><NA>
1798FILE_0000000000054872020-09-25 오후 7:32:22Y<NA><NA><NA><NA><NA>
1799FILE_0000000000054882020-09-25 오후 7:33:21Y<NA><NA><NA><NA><NA>
1800FILE_0000000000054912020-09-28 오후 4:08:10Y<NA><NA><NA><NA><NA>
1801FILE_0000000000054972020-10-07 오전 9:06:05Y<NA><NA><NA><NA><NA>
1802FILE_0000000000055022020-10-07 오후 4:11:42Y<NA><NA><NA><NA><NA>
1803FILE_0000000000055052020-10-07 오후 4:19:01Y<NA><NA><NA><NA><NA>
1804FILE_0000000000055152020-10-12 오후 6:02:13Y<NA><NA><NA><NA><NA>
1805FILE_0000000000055162020-10-12 오후 6:03:25Y<NA><NA><NA><NA><NA>
1806FILE_0000000000055172020-10-12 오후 6:04:20Y<NA><NA><NA><NA><NA>