Overview

Dataset statistics

Number of variables2
Number of observations44
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory836.0 B
Average record size in memory19.0 B

Variable types

Text1
Categorical1

Dataset

Description울산광역시 승용차요일제 RFID 시스템 게시물에 관한 데이터입니다. 게시물 파일명과 확장자의 내용을 포함합니다.(개인정보 미포함)
URLhttps://www.data.go.kr/data/15122220/fileData.do

Reproduction

Analysis started2023-12-12 08:24:10.902154
Analysis finished2023-12-12 08:24:11.220262
Duration0.32 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct42
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Memory size484.0 B
2023-12-12T17:24:11.432210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length20
Mean length13.136364
Min length2

Characters and Unicode

Total characters578
Distinct characters144
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)90.9%

Sample

1st row에러 메세지
2nd row승용차요일제 탈퇴신청서
3rd rowcarfree_add
4th row태그자동발급기 드라이버 설치메뉴얼
5th row할인가맹점 지정 약관
ValueCountFrequency (%)
요일제 11
 
9.6%
승용차 9
 
7.8%
승용차요일제 7
 
6.1%
신청서 5
 
4.3%
할인가맹점 4
 
3.5%
탈퇴신청서 4
 
3.5%
방법 3
 
2.6%
부착 2
 
1.7%
태그 2
 
1.7%
rfid 2
 
1.7%
Other values (57) 66
57.4%
2023-12-12T17:24:11.898930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
71
 
12.3%
24
 
4.2%
22
 
3.8%
1 21
 
3.6%
21
 
3.6%
19
 
3.3%
19
 
3.3%
18
 
3.1%
2 16
 
2.8%
12
 
2.1%
Other values (134) 335
58.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 373
64.5%
Decimal Number 76
 
13.1%
Space Separator 71
 
12.3%
Lowercase Letter 26
 
4.5%
Uppercase Letter 19
 
3.3%
Other Punctuation 5
 
0.9%
Dash Punctuation 4
 
0.7%
Connector Punctuation 2
 
0.3%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
6.4%
22
 
5.9%
21
 
5.6%
19
 
5.1%
19
 
5.1%
18
 
4.8%
12
 
3.2%
11
 
2.9%
10
 
2.7%
9
 
2.4%
Other values (95) 208
55.8%
Lowercase Letter
ValueCountFrequency (%)
e 7
26.9%
r 4
15.4%
n 2
 
7.7%
i 2
 
7.7%
d 2
 
7.7%
a 2
 
7.7%
c 1
 
3.8%
f 1
 
3.8%
w 1
 
3.8%
t 1
 
3.8%
Other values (3) 3
11.5%
Decimal Number
ValueCountFrequency (%)
1 21
27.6%
2 16
21.1%
0 11
14.5%
4 8
 
10.5%
5 6
 
7.9%
7 4
 
5.3%
3 4
 
5.3%
6 2
 
2.6%
8 2
 
2.6%
9 2
 
2.6%
Uppercase Letter
ValueCountFrequency (%)
R 4
21.1%
F 3
15.8%
W 2
10.5%
B 2
10.5%
I 2
10.5%
D 2
10.5%
S 2
10.5%
V 1
 
5.3%
E 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
. 3
60.0%
; 2
40.0%
Space Separator
ValueCountFrequency (%)
71
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 373
64.5%
Common 160
27.7%
Latin 45
 
7.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
6.4%
22
 
5.9%
21
 
5.6%
19
 
5.1%
19
 
5.1%
18
 
4.8%
12
 
3.2%
11
 
2.9%
10
 
2.7%
9
 
2.4%
Other values (95) 208
55.8%
Latin
ValueCountFrequency (%)
e 7
15.6%
r 4
 
8.9%
R 4
 
8.9%
F 3
 
6.7%
W 2
 
4.4%
B 2
 
4.4%
I 2
 
4.4%
n 2
 
4.4%
D 2
 
4.4%
S 2
 
4.4%
Other values (12) 15
33.3%
Common
ValueCountFrequency (%)
71
44.4%
1 21
 
13.1%
2 16
 
10.0%
0 11
 
6.9%
4 8
 
5.0%
5 6
 
3.8%
7 4
 
2.5%
- 4
 
2.5%
3 4
 
2.5%
. 3
 
1.9%
Other values (7) 12
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 373
64.5%
ASCII 205
35.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
71
34.6%
1 21
 
10.2%
2 16
 
7.8%
0 11
 
5.4%
4 8
 
3.9%
e 7
 
3.4%
5 6
 
2.9%
r 4
 
2.0%
7 4
 
2.0%
- 4
 
2.0%
Other values (29) 53
25.9%
Hangul
ValueCountFrequency (%)
24
 
6.4%
22
 
5.9%
21
 
5.6%
19
 
5.1%
19
 
5.1%
18
 
4.8%
12
 
3.2%
11
 
2.9%
10
 
2.7%
9
 
2.4%
Other values (95) 208
55.8%

파일확장자
Categorical

Distinct12
Distinct (%)27.3%
Missing0
Missing (%)0.0%
Memory size484.0 B
.hwp
13 
.jpg
.pdf
.xls
.png
Other values (7)

Length

Max length5
Median length4
Mean length4.0681818
Min length4

Unique

Unique5 ?
Unique (%)11.4%

Sample

1st row.jpg
2nd row.pdf
3rd row.png
4th row.pdf
5th row.hwp

Common Values

ValueCountFrequency (%)
.hwp 13
29.5%
.jpg 9
20.5%
.pdf 8
18.2%
.xls 3
 
6.8%
.png 2
 
4.5%
.zip 2
 
4.5%
.JPG 2
 
4.5%
.exe 1
 
2.3%
.PNG 1
 
2.3%
.docx 1
 
2.3%
Other values (2) 2
 
4.5%

Length

2023-12-12T17:24:12.072790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
hwp 13
29.5%
jpg 11
25.0%
pdf 8
18.2%
xls 3
 
6.8%
png 3
 
6.8%
zip 2
 
4.5%
exe 1
 
2.3%
docx 1
 
2.3%
jpeg 1
 
2.3%
xlsx 1
 
2.3%

Correlations

2023-12-12T17:24:12.173667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파일명파일확장자
파일명1.0000.729
파일확장자0.7291.000

Missing values

2023-12-12T17:24:11.080673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:24:11.177020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

파일명파일확장자
0에러 메세지.jpg
1승용차요일제 탈퇴신청서.pdf
2carfree_add.png
3태그자동발급기 드라이버 설치메뉴얼.pdf
4할인가맹점 지정 약관.hwp
5할인가맹점 지정 신청서.hwp
6보험협회제공 승용차 요일제 특별약관보험상품 안내3.hwp
7보험할인안내2.hwp
8승용차 요일제 보험료 할인제도3.hwp
9보험협회제공 승용차 요일제 특별약관보험상품 안내2.hwp
파일명파일확장자
34S42BW-418011615590.pdf
35비밀번호.JPG
36탈퇴조회 설정 수정 방법 1부.xlsx
37스티커.jpg
38탈퇴신청서.jpg
39승용차 요일제 참여 신청서.hwp
40승용차요일제 할인가맹점 모집 신청서.hwp
41140225 승용차요일제 가맹점 홍보물.pdf
42140225 승용차 요일제 홍보물.pdf
4312.png